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Dedication of "A First Course in Electrical and Computer Engineering" 


Louis Scharf dedicates this book to 
his wife Carol, son Greg, and daughter Heidi, 
for love and inspiration; 
his parents Louis Sr. and Ann, 
in celebration of their 50th wedding anniversary. 


Richard Behrens dedicates this book to 
his wife Debbie, and child as yet unborn, 
for love and encouragement; 
his parents Richard and Elsie, 
in gratitude for a good education. 


Preface to "A First Course in Electrical and Computer Engineering" 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


This book was written for an experimental freshman course at the 
University of Colorado. The course is now an elective that the majority of 
our electrical and computer engineering students take in the second 
semester of their freshman year, just before their first circuits course. Our 
department decided to offer this course for several reasons: 


1. we wanted to pique student’ interest in engineering by acquainting 
them with engineering teachers early in their university careers and by 
providing with exposure to the types of problems that electrical and 
computer engineers are asked to solve; 

2. we wanted students entering the electrical and computer engineering 
programs to be prepared in complex analysis, phasors, and linear 
algebra, topics that are of fundamental importance in our discipline; 

3. we wanted students to have an introduction to a software application 
tool, such as MATLAB, to complete their preparation for practical and 
efficient computing in their subsequent courses and in their 
professional careers; 

4. we wanted students to make early contact with advanced topics like 
vector graphics, filtering, and binary coding so that they would gain a 
more rounded picture of modern electrical and computer engineering. 


In order to introduce this course, we had to sacrifice a second semester of 
Pascal programming. We concluded that the sacrifice was worth making 
because we found that most of our students were prepared for high-level 
language computing after just one semester of programming. 


We believe engineering educators elsewhere are reaching similar 
conclusions about their own students and curriculums. We hope this book 
helps create a much needed dialogue about curriculum revision and that it 
leads to the development of similar introductory courses that encourage 
students to enter and practice our craft. 


Students electing to take this course have completed one semester of 
calculus, computer programming, chemistry, and humanities. Concurrently 
with this course, students take physics and a second semester of calculus, as 
well as a second semester in the humanities. By omitting the advanced 
topics marked by asterisks, we are able to cover Complex Numbers through 
Linear Algebra, plus two of the three remaining chapters. The book is 
organized so that the instructor can select any two of the three. If every 
chapter of this book is covered, including the advanced topics, then enough 
material exists for a two-semester course. 


The first three chapters of this book provide a fairly complete coverage of 
complex numbers, the functions e* and e/, and phasors. Our department 
philosophy is that these topics must be understood if a student is to succeed 
in electrical and computer engineering. These three chapters may also be 
used as a supplement to a circuits course. A measured pace of presentation, 
taking between sixteen and eighteen lectures, is sufficient to cover all but 
the advanced sections in Complex Numbers through Phasors. 


The chapter on "linear algebra" is prerequisite for all subsequent chapters. 
We use eight to ten lectures to cover it. We devote twelve to sixteen lectures 
to cover topics from Vector Graphics through Binary Codes. (We assume a 
semester consisting of 42 lectures and three exams.) The chapter on vector 
graphics applies the linear algebra learned in the previous chapter to the 
problem of translating, scaling, and rotating images. ''Filtering" introduces 
the student to basic ideas in averaging and filtering. The chapter on "Binary 
Codes" covers the rudiments of binary coding, including Huffman codes 
and Hamming codes. 


If the users of this book find "Vector Graphics" through "Binary Codes" too 
confining, we encourage them to supplement the essential material in 
"Complex Numbers" through "Linear Algebra" with their own course notes 
on additional topics. Within electrical and computer engineering there are 


endless possibilities. Practically any set of topics that can be taught with 
conviction and enthusiasm will whet the student's appetite. We encourage 
you to write to us or to our editor, Tom Robbins, about your ideas for 
additional topics. We would like to think that our book and its subsequent 
editions will have an open architecture that enables us to accommodate a 
wide range of student and faculty interests. 


Throughout this book we have used MATLAB programs to illustrate key 
ideas. MATLAB is an interactive, matrix-oriented language that is ideally 
suited to circuit analysis, linear systems, control theory, communications, 
linear algebra, and numerical analysis. MATLAB is rapidly becoming a 
standard software tool in universities and engineering companies. (For more 
information about MATLAB, return the attached card in the back of this 
book to The MathWorks, Inc.) MATLAB programs are designed to develop 
the student's ability to solve meaningful problems, compute, and plot in a 
high-level applications language. Our students get started in MATLAB by 
working through “An Introduction to MATLAB,” while seated at an IBM 
PC (or look-alike) or an Apple Macintosh. We also have them run through 
the demonstration programs in "Complex Numbers". Each week we give 
three classroom lectures and conduct a one-hour computer lab session. 
Students use this lab session to hone MATLAB skills, to write programs, or 
to conduct the numerical experiments that are given at the end of each 
chapter. We require that these experiments be carried out and then reported 
in a short lab report that contains (i) introduction, (ii) analytical 
computations, (iii) computer code, (iv) experimental results, and (v) 
conclusions. The quality of the numerical results and the computer graphics 
astonishes students. Solutions to the chapter problems are available from 
the publisher for instructors who adopt this text for classroom use. 


We wish to acknowledge our late colleague Richard Roberts, who 
encouraged us to publish this book, and Michael Lightner and Ruth 
Ravenel, who taught ''Linear Algebra" and '"Vector Graphics" and offered 
helpful suggestions on the manuscript. We thank C. T. Mullis for allowing 
us to use his notes on binary codes to guide our writing of "Binary Codes". 
We thank Cédric Demeure and Peter Massey for their contributions to the 
writing of "An Introduction to MATLAB" and "The Edix Editor". We thank 
Tom Robbins, our editor at Addison-Wesley, for his encouragement, 


patience, and many suggestions. We are especially grateful to Julie 
Fredlund, who composed this text through many drafts and improved it in 
many ways. We thank her for preparing an excellent manuscript for 
production. 


L. L. Scharf 
R. T. Behrens 
Boulder, Colorado 


To the Teacher: 


An incomplete understanding of complex numbers and phasors handicaps 
students in circuits and electronics courses, and even more so in advanced 
courses such as electromagnetics. optics, linear systems, control, and 
communication systems. Our faculty has decided to address this problem as 
early as possible in the curriculum by designing a course that drills complex 
numbers and phasors into the minds of beginning engineering students. We 
have used power signals, musical tones, Lissajous figures, light scattering, 
and RLC circuits to illustrate the usefulness of phasor calculus. "Linear 
Algebra" through "Binary Codes" introduce students to a handful of modern 
ideas in electrical and computer engineering. The motivation is to whet 
students' appetites for more advanced problems. The topics we have chosen 
— linear algebra, vector graphics, filtering, and binary codes — are only 
representative. 


At first glance, many of the equations in this book look intimidating to 
beginning students. For this reason, we proceed at a very measured pace. In 
our lectures, we write out in agonizing detail every equation that involves a 
N-1 
sequence or series. For example, the sum ys z” is written out as 
n=0 


Equation: 


and then it is evaluated for some specific value of z before we derive the 
N 


=e 
analytical result a Similarly, an infinite sequence like 
=2 


lim (1 -- =) is written out as 
N—+00 n 


Equation: 


BAe eee ae 00 
1+2), (1 =) . (+=) — (1+—) ae 
oe Gea 3 100 


and then it is evaluated for some specific x and for several values of n 
before the limit is derived. We try to preserve this practice of pedantic 
excess until it is clear that every student is comfortable with an idea and the 
notation for coding the idea. 


To the Student: 


These are exciting times for electrical and computer engineering. To 
celebrate its silver anniversary, the National Academy of Engineering 
announced in February of 1990 the top ten engineering feats of the previous 
twenty-five years. The Apollo moon landing, a truly Olympian and protean 
achievement, ranked number one. However, a number of other 
achievements in the top ten were also readily identifiable as the products of 
electrical and computer engineers: 


1. communication and remote sensing satellites, 

2. the microprocessor, 

3. computer-aided design and manufacturing (CADCAM), 
4. computerized axial tomography (CAT scan), 

5. lasers, and 

6. fiber optic communication. 


AS engineering students, you recognize these achievements to be important 
milestones for humanity; you take pride in the role that engineers have 
played in the technological revolution of the twentieth century. 


So how do we harness your enthusiasm for the grand enterprise of 
engineering? Historically, we have enrolled you in a freshman curriculum 
of mathematics, science, and humanities. If you succeeded, we enrolled you 
in an engineering curriculum. We then taught you the details of your 
profession and encouraged your faith that what you were studying is what 
you must study to be creative and productive engineers. The longer your 
faith held, the more likely you were to complete your studies. This seems 
like an imperious approach to engineering education, even though 
mathematics, physics, and the humanities are the foundation of engineering, 
and details are what form the structure of engineering. It seems to us that a 
better way to stimulate your enthusiasm and encourage your faith is to 
introduce you early in your studies to engineering teachers who will share 
their insights about some of the fascinating advanced topics in engineering, 
while teaching you the mathematical and physical principles of engineering. 
But you must match the teacher's commitment with your own commitment 
to study. This means that you must attend lectures, read texts, and work 
problems. You must be inquisitive and skeptical. Ask yourself how an idea 
is limited in scope and how it might be extended to apply to a wider range 
of problems. For, after all, one of the great themes of engineering is that a 
few fundamental ideas from mathematics and science, coupled with a few 
principles of design, may be applied to a wide range of engineering 
problems. Good luck with your studies. 


Complex Numbers: Introduction 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Notes to Teachers and Students: 


When we teach complex numbers to beginning engineering students, we 
encourage a geometrical picture supported by an algebraic structure. Every 
algebraic manipulation carried out in a lecture is accompanied by a care- 
fully drawn picture in order to fix the idea that geometry and algebra go 
hand-in-glove to complete our understanding of complex numbers. We 
assign essentially every problem for homework. 


We use the MATLAB programs in this chapter to illustrate the theory of 
complex numbers and to develop skill with the MATLAB language. The 
numerical experiment introduces students to the basic quadratic equation of 
electrical and computer engineering and shows how the roots of this 
quadratic equation depend on the coefficients of the equation. 


for freshmen but easily accessible to sophomores. It may be covered for 
additional insight, skipped without consequence, or covered after Chapter 4. 
“An Electric Field Computation,” is well beyond most freshmen, and it is 
demanding for sophomores. Nonetheless, an expert in electromagnetics 
might want to cover the section "An electric Field Computation" for the 
insight it brings to the use of complex numbers for representing two- 
dimensional real quantities. 


Introduction 


It is hard to overestimate the value of complex numbers. They first arose in 
the study of roots for quadratic equations. But, as with so many other great 
discoveries, complex numbers have found widespread application well 
outside their original domain of discovery. They are now used throughout 
mathematics, applied science, and engineering to represent the harmonic 
nature of vibrating systems and oscillating fields. For example, complex 
numbers may be used to study 


i. traveling waves on a Sea surface; 
il. standing waves on a violin string; 
iii. the pure tone of a Kurzweil piano; 
iv. the acoustic field in a concert hall; 
v. the light of a He-Ne laser; 
vi. the electromagnetic field in a light show; 
vii. the vibrations in a robot arm; 
viii. the oscillations of a suspension system; 
ix. the carrier signal used to transmit AM or FM radio; 
x. the carrier signal used to transmit digital data over telephone lines; and 
xi. the 60 Hz signal used to deliver power to a home. 


In this chapter we develop the algebra and geometry of complex numbers. 
In Chapter 3 we will show how complex numbers are used to build phasor 
representations of power and communication signals. 


Complex Numbers: Geometry of Complex Numbers 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


The most fundamental new idea in the study of complex numbers is the 
“imaginary number” 7. This imaginary number is defined to be the square 
root of —1: 

Equation: 


Equation: 


The imaginary number 7 is used to build complex numbers x and y in the 
following way: 
Equation: 


2 iy. 
We say that the complex number z has “real part” z and “imaginary part” y: 
Equation: 
z= Relz] + jIm[z| 
Equation: 


Re[z] = z; Im[z] = y. 


In MATLAB, the variable x is denoted by real (Zz), and the variable y is 
denoted by imag(Z ). In communication theory, x is called the “in-phase” 
component of z, and y is called the “quadrature” component. We call z = 
x + 7y the Cartesian representation of z, with real component x and 
imaginary component y. We say that the Cartesian pair (x, y)codes the 
complex number z. 


We may plot the complex number z on the plane as in [link]. We call the 
horizontal axis the “real axis” and the vertical axis the “imaginary axis.” 
The plane is called the “complex plane.” The radius and angle of the line to 
the point z = x + 7y are 

Equation: 


r=/xr+y 


6 =tan } (=). 


See [link]. In MATLAB, r is denoted by abs(Z), and @ is denoted by 
angle(z). 


Equation: 


imaginary axis (J) 


rsin@=Y Z- £+Jy-re® 


real axis 


T= cos 


Cartesian and Polar Representations of the Complex Number z 


The original Cartesian representation is obtained from the radius r and 
angle 6 as follows: 
Equation: 


x = rcos0 
Equation: 
y=r sin 0. 
The complex number z may therefore be written as 
Equation: 


a x+ jy 
= rcos@ + jrsin8 


= r (cos6@+ jsin 6). 


The complex number cos@ + jsind is, itself, a number that may be 
represented on the complex plane and coded with the Cartesian pair 

(cos 6, sin@). This is illustrated in [link]. The radius and angle to the point 
z =cos 0+ jsin@ are 1 and @. Can you see why? 


imaginary axis 


sin@ e 
cos@ +7sin@ = e° 


real axis 


cos@ 


The Complex Number cos @ + 7 sin 0 


The complex number cos@ + jsind is of such fundamental importance to 
our study of complex numbers that we give it the special symbol e”” : 
Equation: 

0 


e!” —cos 6 + jsind. 


As illustrated in [link], the complex number e)9 has radius 1 and angle 0. 
With the symbol e?’, we may write the complex number z as 
Equation: 


z— re”, 


We call z = re?” a polar representation for the complex number z. We say 
that the polar pair 7 Z@codes the complex number z. In this polar 
representation, we define |z| = r to be the magnitude of z and arg (z) = 0 
to be the angle, or phase, of z: 

Equation: 


jz] =r 
Equation: 
arg (z) = 80. 
With these definitions of magnitude and phase, we can write the complex 


number z as 
Equation: 


z= |zlei8), 


Let's summarize our ways of writing the complex number z and record the 
corresponding geometric codes: 
Equation: 


z= at+jy = re® = Jz|ef rel), 


+ + 
(x, y) rZ@0 


In "Roots of Quadratic Equations" we show that the definition 

e® — cos0 + jsin@ is more than symbolic. We show, in fact, that ce) is just 
the familiar function e® evaluated at the imaginary argument x = j@. We 
call e/? a “complex exponential,” meaning that it is an exponential with an 
imaginary argument. 

Exercise: 


Problem: 


Prove (j)°" = (—1)” and (j)?""* = (—1)"j. Evaluate j°, 74, 7°. 


Exercise: 


Problem: 
Prove eil(/2)+m2n] _ 5 eil(3n/2)+m2n] _ 5 @i(0+m2n) _ 1 and 
ej("+m2r) — _1, Plot these identities on the complex plane. (Assume 
m is an integer.) 

Exercise: 
Problem: 


Find the polar representation z = re?” for each of the following 
complex numbers: 


a. 2=> 1+ 90; 
b2=0-+ 71; 
cz=1-+4 1; 
d.g==-1L— 9L 


Plot the points on the complex plane. 


Exercise: 


Problem: 


Find the Cartesian representation z = x + jy for each of the following 
complex numbers: 


a= Vv 2¢i"/? ; 
b= J/2e7/4; 
c. z= efsa/4; 


d.z = V2e87/2. 
Plot the points on the complex plane. 
Exercise: 
Problem: 


The following geometric codes represent complex numbers. Decode 
each by writing down the corresponding complex number z: 


a. (0.7,-0.1) z=? 
b. (160-5). 2= 
G1h627/8 2=7 
d.0.4271/8 z=? 


Exercise: 


Problem: 


Show that Im[jz] = Re[z] and Re|—jz| = Im[z]. Demo 1.1 
(MATLAB). Run the following MATLAB program in order to 
compute and plot the complex number e/? for 

6 = i2n/360,7 = 1,2,...,360: j=sqrt(-1) n=360 for 
1=1:n, circle(1)=exp(j*2*pi*i/n);end; 
axis('square') plot(circle) Replace the explicit for loop 
of line 3 by the implicit loop circle=exp(j*2*pi*[1:n]/n); 
to speed up the calculation. You can see from [link] that the complex 
number e, evaluated at angles 6 = 27/360, 2(27/360), ..., turns out 
complex numbers that lie at angle and radius 1. We say that e7? is a 
complex number that lies on the unit circle. We will have much more 
to say about the unit circle in Chapter 2. 


-1 -0.5 0 0.5 1 


The Complex Numbers e/* for 0 < 0 < 27 (Demo 1.1) 


Complex Numbers: Algebra of Complex Numbers 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


The complex numbers form a mathematical “field” on which the usual 
operations of addition and multiplication are defined. Each of these 
operations has a simple geometric interpretation. 


Addition and Multiplication. 
The complex numbers 2, and Z2 are 


added according to the rule 
Equation: 


Zz+2. = ("14+ Jy1) + (v2 + jy2) 
= (1+ 22)+ j(yi + y2)- 


We say that the real parts add and the imaginary parts add. As illustrated in 
[link], the complex number z; + z2 is computed from a “parallelogram 
rule,” wherein 2; + Z2 lies on the node of a parallelogram formed from z1 
and Z9. 

Exercise: 


Problem: 
Let z; = re" and z) = rje”. Find a polar formula z3 =r3e!3 for 


23 = 21 + 22 that involves only the variables r;, r2, 01, and 02. The 
formula for rz is the “law of cosines.” 


The product of z; and zp is 
Equation: 


Z122 = (x1 + jy1)(@2 + Jy2) 
= (x12 = Y1Y2) =F j(yix2 ae £1Y2). 


Adding Complex Numbers 


If the polar representations for z; and Z2 are used, then the product may be 
written as [footnote | 

We have used the trigonometric identities cos (0; + 62) =cos 0; cos @2— 
sin 6, sin 8 and sin (6; + 02) =sin 0; cos 62+ cos 9; sin 0 to derive this 
result. 

Equation: 


Zig3 = rer poet 


| 


(r1 cos 0; + jr, sin 61)(r2 cos 02 + jrg sin 62) 
= (11 cos 0r2 cos 62 —7, sin O1r2 sin 62) 

+ j( ry sin O1r2 cos 62+ 711 cos O17r2 sin 02) 
= rr cos (0; + 82) + jrire sin (0; + 02) 


= ryryed(Ar+ 2) | 


We say that the magnitudes multiply and the angles add. As illustrated in 
[link], the product 212 lies at the angle (0; + 62). 


Multiplying Complex Numbers 


Rotation. There is a special case of complex multiplication that will become 
very important in our study of phasors in the chapter on Phasors. When 2 
is the complex number z, = r,e2% and zz is the complex number 

Zz. = e/*, then the product of z; and 29 is 


Equation: 


2129 = 2109 = ry e914), 


As illustrated in [link], z1 Z2 is just a rotation of z, through the angle Oo. 


Zere 


Rotation of Complex Numbers 


Exercise: 


Problem: 


Begin with the complex number z; = x + jy = reJ”. Compute the 
complex number z) = 72, in its Cartesian and polar forms. The 
complex number z2 is sometimes called perp(z1). Explain why by 
writing perp(z1) as 21 eJ”2, What is 92? Repeat this problem for 

£3 = —JZ1. 


Powers. If the complex number z; multiplies itself NV times, then the result 
is 
Equation: 


N ,jNO1 


N 
) =; 


(21 


This result may be proved with a simple induction argument. Assume 


“ = ree, (The assumption is true for k = 1.) Then use the recursion 


rae =.= itt edb +1) 61 Iterate this recursion (or induction) until 
k +1 = N. Can you see that, as n ranges from n = 1,..., N, the angle of 
z§ranges from 0, to 20;, ..., to N@, and the radius ranges from r, to ee er 


to re ? This result is explored more fully in Problem 1.19. 


Complex Conjugate. Corresponding to every complex number z = 
a + jy = re” is the complex conjugate 
Equation: 


z=@2- jy =re . 
The complex number z and its complex conjugate are illustrated in [link]. 


The recipe for finding complex conjugates is to “change jto — 7. This 
changes the sign of the imaginary part of the complex number. 


imaginary axis 


real axis 


A Complex Variable and Its Complex Conjugate 


Magnitude Squared. The product of z and its complex conjugate is called 
the magnitude squared of z and is denoted by ||” : 
Equation: 

jz|? =z z= (¢— jy) (a2@4+ jy) =a? +y? = re re® =r’. 


Note that |z| = r is the radius, or magnitude, that we defined in "Geometry 
of Complex Numbers". 
Exercise: 


Problem: 
Write z as z= zw. Find w in its Cartesian and polar forms. 


Exercise: 


Problem: Prove that angle (z2z1) = 0) — 0}. 


Exercise: 


Problem: 


Show that the real and imaginary parts of z = x + jzy may be written 
as 
Equation: 


Equation: 


Im[z]=2j z-z . 


Commutativity, Associativity, and Distributivity. The complex numbers 
commute, associate, and distribute under addition and multiplication as 
follows: 

Equation: 


Z4+2 = 2+21 
2122 a 2221 


Equation: 


(21 ae 22) gay = 2a (zo at 23) 
21(2223) = (2122) z3 
zi(z2 +23) = 2122+ 2123. 


Identities and Inverses. In the field of complex numbers, the complex 
number 0 + 70 (denoted by 0) plays the role of an additive identity, and the 
complex number 1 + 70 (denoted by 1) plays the role of a multiplicative 
identity: 

Equation: 


ZO! = = 0+2 
a S22: wiz. 

In this field, the complex number —z = —x + j(—y) is the additive inverse 

ae x ; —Y : 
of z, and the complex number z ee —— isthe 

P. 2 A y J 2 a y? 
multiplicative inverse: 
Equation: 
z+(-z) = 0 

zz 

Exercise: 
Problem: 


Show that the additive inverse of z = re?” may be written as reJ(9+7), 


Exercise: 


Problem: Show that the multiplicative inverse of z may be written as 
Equation: 


~__ (a — jy) 
20 =z = —— > (a — jy). 
ZZ x? + y? 1 


1 


Show that z zis real. Show that z~ may also be written as 


Equation: 
zgi=rte®, 


Plot z and z~! for a representative z. 


Exercise: 


Problem: Prove (j) © = —J. 


Exercise: 


Problem: Find z~! when z = 1+ jl. 
Exercise: 
Problem: 
Prove (21) —~ 2g =p le — 3 Plot z and (2-1) for a 
representative z. -_ 
Exercise: 
Problem: 


Find all of the complex numbers z with the property that jz = —z’. 
Illustrate these complex numbers on the complex plane. 


Demo 1.2 (MATLAB). Create and run the following script file (name it 
Complex Numbers)[footnote | 
If you are using PC-MATLAB, you will need to name your file cmplxnos.m. 


clear, clg 

j=sqrt(-1) 

Z1=1+j* .5,Z2=2+j)*1.5 

Z23=21+Z2, Z4=Z1* 22 

z5=conj(z1),zZ6=j*z2 

avis([-4 4 -4 4]),axis('square'),plot(z1, '0') 
hold on 
plot(z2,'0'),plot(z3,'+'),plot(z4,'*'), 
plot(z2,'0'),plot(z3,'+'),plot(z4,'*'), 
plot(z5,'x'),plot(z6, 'x') 


-2 0 v4 4 


Complex Numbers (Demo 1.2) 


With the help of Appendix 1, you should be able to annotate each line of 
this program. View your graphics display to verify the rules for add, 
multiply, conjugate, and perp. See [link]. 

Exercise: 


Problem: Prove that z? = 1. 


Exercise: 


Problem: 


(MATLAB) Choose z; = 1.05e/27/"6 and z2 = 0.95e/27/1®, Write a 
MATLAB program to compute and plot z} and z5 forn = 1, 2,..., 32. 
You should observe a figure like [link]. 


Powers of z 


Complex Numbers: Roots of Quadratic Equations 


You probably first encountered complex numbers when you studied values 
of z (called roots or zeros) for which the following equation is satisfied: 
Equation: 


az? +bz+c=0. 


For a # 0 (as we will assume), this equation may be written as 
Equation: 


b 
P+ 224—=0. 
a a 


Let's denote the second-degree polynomial on the left-hand side of this 
equation by p(z): 
Equation: 


This is called a monic polynomial because the coefficient of the highest- 
power term (2?) is 1. When looking for solutions to the quadratic equation 


z+ eas = 0, we are really looking for roots (or zeros) of the 


sokaiornial ey The fundamental theorem of algebra says that there are 
two such roots. When we have found them, we may factor the polynomial 
p(z) as follows: 

Equation: 


In this equation, z; and z» are the roots we seek. The factored form p(z) = 
(z — 21) (z — Z2) shows clearly that p(z1) = p(z2) = 0, meaning that the 


quadratic equation p(z) = 0 is solved for z = z; and z = 22. In the process 
of factoring the polynomial p(z), we solve the quadratic equation and vice 
versa. 


By equating the coefficients of z”, z’, and z? on the left-and right-hand 
sides of [link], we find that the sum and the product of the roots z; and z2 
obey the equations 

Equation: 


Ae SO 


= Cc 
4142 OG 


You should always check your solutions with these equations. 


Completing the Square. In order to solve the quadratic equation 


Cc 
z* + —z+— =0 (or, equivalently, to find the roots of the polynomial 
a a 


b 
ae ee <) , we “complete the square” on the left-hand side of [link]: 
a a 


pat) fl OY ae 2g 
2a 2a a 


This equation may be rewritten as 
Equation: 


Equation: 


We may take the square root of each side to find the solutions 
Equation: 


b 1 
Bi + oa b2 — 4ac. 


a 


Exercise: 


Problem: 


With the roots z, and 2, defined in Equation 1.29, prove that 


b 
(z — z1) (z — 22) is, indeed, equal to the polynomial z” + Pe os. 


b Cc 
Check that z1 + z2 = —— and z,z2 = — 
a a 
In the equation that defines the roots z; and z2, the term b? — 4ac is critical 
because it determines the nature of the solutions for z; and 22. In fact, we 


may define three classes of solutions depending on b? — 4ac. 


(i) Overdamped (b? — 4ac > 0). In this case, the roots z; and Z2 are 
Equation: 


These two roots are real, and they are located symmetrically about the point 


b 
aa When 6 = 0, they are located symmetrically about 0 at the points 
a 


1 ee 

a V—Aac. (In this case, —4ac > 0.) Typical solutions are illustrated in 
a 

[link]. 


Typical Roots in the Overdamped Case; (a) 


b/2a > 0, 4ac > 0, (b) b/2a > 0, 4ac < 0, and 
(c) b/2a = 0, 4ac < 0 


Exercise: 


Problem: 


Compute and plot the roots of the following quadratic equations: 


1 
a z2+2z+—=0; 


i 
be 4 2a— 2 = (): 
1 
9 
.2—-—=0. 
C2 5 
b Cc 
For each equation, check that 2; + z2 = — . and 2122 = a 


(ii) Critically Damped (0 — 4ac = 0). In this case, the roots z1 and Z2 
are equal (we say they are repeated): 
Equation: 


Roots in the Critically Damped Case; (a) 
b/2a > 0, and (b) b/2a < 0 


Exercise: 


Problem: 
Compute and plot the roots of the following quadratic equations: 
a2 4224 1=0 


b. 22 —-2z+1=0:; 
c. 227 =0. 


b Ee 
For each equation, check that z1 + 22 = —— and z1z2 = — 
a a 


(iii) Underdamped (0 — 4ac < 0). The underdamped case is, by far, the 
most fascinating case. When b2 — dac < 0, then the square root in the 
solutions for z, and 2» ({link]) produces an imaginary number. We may 
write b? — dac as — (4ac = b?) and write 21,2 as 

Equation: 


Z2 = os + 3 /—(4ac — b?) 


= -£ + j¢v4ac — b?. 


These complex roots are illustrated in [link]. Note that the roots are 


purely imaginary when b = 0, producing the result 


Equation: 
fe 
21,2 = if © 
a 


i 2 
2a 4ac-h 


@ 


1 
aav4ac- b? 


Figure 1.12: Typical Roots in the Underdamped Case; (a) 
b/2a > 0, (b)b/2a < 0, and (c) b/2a = 0 


In this underdamped case, 


Equation: 


Thus the polynomial p (z) 


takes the form 
Equation: 


p(z) 


the roots z; and zz are complex conjugates: 


* 


22 21> 


b 
Pn a = (z— 21) (z — 22) also 
a a 


(2-21) (2-2) 


z? —2Re [zz + lz’. 


Re[z1] and z1|? are related to the original coefficients of the polynomial as 


follows: 
Equation: 


b 
2 Re [zi] = ee 


Equation: 


Always check these equations. 


Let's explore these connections further by using the polar representations 
for z, and 22: 
Equation: 


ral ie 10 
fio =yeo. 


Then [link] for the polynomial p(z) may be written in the “standard form” 
Equation: 


(z — re?) (z — re) 


27 —2r cos 6z+r". 


p(z) 


[link] is now 
Equation: 


2rcos9 = ——= 


r? = 
a 


These equations may be used to locate z1,.2 = ret? 


Equation: 
= 
— ye 


at —b 
+ cos (& ): 


S 
| 


Exercise: 


Problem: 
Prove that p(z) may be written as p(z) = z? — 2r cos 0z + r? in the 
underdamped case. 

Exercise: 


Problem: 


Prove the relations in [link]. Outline a graphical procedure for locating 


j a . Cc 
z1 = re and z) = re ”” from the polynomial Ee —, 
a a 


Exercise: 


Problem: 
Compute and plot the roots of the following quadratic equations: 


a. 27+2z+2=0; 
b. 22 —2z+2=0: 
E24 2=0. 


b 2 
For each equation, check that 2 Re [z1,2] = —— and | 
a 


21,2 


Complex Numbers: Representing Complex Numbers in a Vector Space 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
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So far we have coded the complex number z = x + jy with the Cartesian 
pair (a, y) and with the polar pair (rZ0). We now show how the complex 
number z may be coded with a two-dimensional vector z and show how this 
new code may be used to gain insight about complex numbers. 

Coding a Complex Number as a Vector. We code the complex number 


. ° ° ° x 
z= «+ 7y with the two-dimensional vector z = | 


Equation: 


: x 
etqyyH2ea= r 
y 


We plot this vector as in [link]. We say that the vector z belongs to a 
“vector space.” This means that vectors may be added and scaled according 
to the rules 

Equation: 


XL ie 


Zj+rZ2 = 
; + Y2 


Equation: 


IL 


The Vector z Coding the Complex Number z 


Furthermore, it means that an additive inverse —z, an additive identity O, 
and a multiplicative identity 1 all exist: 
Equation: 


Equation: 


0 
The vector 0 is 0 = i ; 


Prove that vector addition and scalar multiplication satisfy these properties 
of commutation, association, and distribution: 
Equation: 


Z1 + Z2 = Z2+ 21 


Equation: 


(2, + 22) + 23 = 21 + (Zo + 23) 
Equation: 


a(bz) 


(ab)z 
Equation: 
a(z, + 22) = az, + azo. 
Inner Product and Norm. The inner product between two vectors z; and 
Z» is defined to be the real number 


Equation: 


(Z1,Z2) = 122 + yiyp. 


We sometimes write this inner product as the vector product (more on this 


in Linear Algebra) 
Equation: 
(z1,Z2) = ZZ 
X2 
= |%1 YI = | %1%2 + Y1Y2 }- 
y¥2 
Exercise: 


Problem: Prove (Z1, Z2) = (Z, Z1). 


When Zz; = Z2 = Z, then the inner product between z and itself is the norm 
squared of z: 
Equation: 


These properties of vectors seem abstract. However, aS we now show, they 
may be used to develop a vector calculus for doing complex arithmetic. 


A Vector Calculus for Complex Arithmetic. The addition of two complex 
numbers z; and z2 corresponds to the addition of the vectors z, and Za : 
Equation: 


t1+ X92 
Z4+228%,+2Z2 = 
yi + Y2 


The scalar multiplication of the complex number z2 by the real number x, 
corresponds to scalar multiplication of the vector z» by x1 : 
Equation: 


belies Peed 
W122 9 X = ‘ 
Y2 L1Y2 


Similarly, the multiplication of the complex number Z2 by the real number 
y1 is 
Equation: 


£2 Y1r2 
Y122 <7 n| = | r 
Yy2 Y1Yy2 


The complex product 21z2 = (x1 + jy1)Z2 is therefore represented as 
Equation: 


ee = a 
2122 > : 
L1Y2 + YiL2 


This representation may be written as the inner product 
Equation: 


ae 


2122 = 2921 hae 
» 41 


z 

where v and w are the vectors v = | : | and w = [al . By defining the 
—Y2 L2 

matrix 

Equation: 


* a 
Yy2 wr 


we can represent the complex product 2,22 as a matrix-vector multiply 
(more on this in Linear Algebra): 
Equation: 


* cy a 
2122 = 2921 : 
Y2 £2 Y1 


With this representation, we can represent rotation as 
Equation: 


zel? — ez 45 


cos @ —sin | | 


sin@ cos@ XL 


cos @ — sin @ 


We call the matrix | | a “rotation matrix.” 


sin@ cos@ 
Exercise: 


Problem: Call R(@) the rotation matrix: 
Equation: 


sin@ cos@ 


R (6) = ae ae 


Show that R(—@) rotates by (—@). What can you say about R(—0)w 
when w = R(0)z? 


Exercise: 


Problem: Represent the complex conjugate of z as 


Equation: 
* a a b 4 0 
‘ c d| ly 


and find the elements a, b,c, and d of the matrix. 


Inner Product and Polar Representation. From the norm of a vector, we 
derive a formula for the magnitude of z in the polar representation z = re” 


Equation: 


r= at ty ? = |[z|| = (2,2). 


il 0 
If we define the coordinate vectors e; = A and ey = A , then we can 


represent the vector z as 
Equation: 


Z = (Z,e1)e; + (Z, e2)eo. 
See [link]. From the figure it is clear that the cosine and sine of the angle 0 


are 
Equation: 


Representation of z in its Natural Basis 


This gives us another representation for any vector z: 
Equation: 


z = ||z|| cos Oe; + ||z|| sin Beg. 


The inner product between two vectors 2; and Z2 is now 
Equation: 


T le He 


(Z1, 22) = (z1,e1)e7 (z1, 2 )e5 (22, €2)e2 


(Z1, €1) (Z2, 1) + (21, €2) (Ze, €2) 
||Z1|| cos 4, ||z2|| cos 62 + ||z1|| sin 41 ||Z2 || sin Oo. 


It follows that cos (82 — 01) =cos 02 cos 6;+ sin 6; sin 62 may be 
written as 
Equation: 


cos (0) — 6;) = (21,22) 


[laa |} [ee II 


This formula shows that the cosine of the angle between two vectors z1 and 


* 
Z2, which is, of course, the cosine of the angle of 222, is the ratio of the 
inner product to the norms. 
Exercise: 


Problem: 


Prove the Schwarz and triangle inequalities and interpret them: 
Equation: 


(Schwarz) (z1, 22)” < ||za||? ||z2||? 
Equation: 


(triangle) I z; — 22 || < ||z1 —z3||+||z2— 23 ||. 
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We have established that vectors may be used to code complex numbers. 
Conversely, complex numbers may be used to code or represent the 
orthogonal components of any two-dimensional vector. This makes them 
invaluable in electromagnetic field theory, where they are used to represent 
the components of electric and magnetic fields. 


The basic problem in electromagnetic field theory is to determine the 
electric or magnetic field that is generated by a static or dynamic 
distribution of charge. The key idea is to isolate an infinitesimal charge, 
determine the field set up by this charge, and then to sum the fields 
contributed by all such infinitesimal charges. This idea is illustrated in 
[link], where the charge A, uniformly distributed over a line segment of 
length dx at point —2, produces a field dE(~) at the test point (0, A). The 
field dE (x) is a “vector” field (as opposed to a “scalar” field), with 
components £ (a) and F2 (x). The intensity or field strength of the field 
dE() is 

Equation: 


Adx 


os Arey (h? + x?) 


Infinitesimal Charge Adx Producing Field dE (x) 


But the field strength is directed at angle 0(), as illustrated in [link]. The 
field dE (x) is real with components dE; (x) and dE (x), but we code it 
as a complex field. We say that the “complex” field at test point (0, A) is 
Equation: 


Adx 
dE a) 
(2) Arey (h? + 2?) : 


with components dF, (x) and dE (x). That is, 
Equation: 


dE (x) = dE; (x) + jdE2 (x) 


Equation: 


Adz 
dE = —________. 0 
1 (2) Ameég (h? + 2?) ee) 
Equation: 
Ad 
dE» (x) a sin 0(x). 


~ Amen (h? + 2?) 


For charge uniformally distributed with density A along the x-axis, the total 
field at the test point (0, A) is obtained by integrating dE: 
Equation: 


if. i / © <> __ [eos 0 (a) + jaind (2)]dz. 


a oo Amey (h? + x?) 


The functions cos 6(z) and sin 0(x) are 
Equation: 


cos 0(x) = 


We leave it as a problem to show that the real component F; of the field is 
zero. The imaginary component £7 is 


Equation: 
_[? Ah dx 
B=jh-j/ 7-———, 
—_o« “7€9 (x? +h?) 
Equation: 
Ah x ™ 
a, 


ATE9 h2(22 + pn2)l? 


Equation: 


_. AR 1 x 1  . A 
~ I are h2 sh? ~ 4 Oneoh 
Equation: 
DN 
Ey = : 
2regh 


We emphasize that the field at (0, h) is a real field. Our imaginary answer 
simply says that the real field is oriented in the vertical direction because 
we have used the imaginary part of the complex field to code the vertical 
component of the real field. 

Exercise: 


Problem: 


Show that the horizontal component of the field F is zero. Interpret 
this finding physically. 


From the symmetry of this problem, we conclude that the field around the 
infinitely long wire of [link] is radially symmetric. So, in polar coordinates, 
we could say 

Equation: 


mA 


= 2TE0r 


which is independent of 6. If we integrated the field along a radial line 
perpendicular to the wire, we would measure the voltage difference 
Equation: 


V(r) -V (ro) =[ 


aN a 
r= — logr,—logro . 
QTE Or 2TEy 


An electric field has units of volts/meter, a charge density \ has units of 
coulombs/meter, and €g has units of coulombs/volt-meter; voltage has units 
of volts (of course). 
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There is a version of the quadratic equation that will arise over and over 
again in your study of electrical and mechanical systems: 
Equation: 


s? + 2wos + we = 0. 


For reasons that can only become clear as you continue your study of 
engineering, the parameter wy is called a resonant frequency, and the 
parameter € > 0 is called a damping factor. In this experiment, you will 
begin by 


1. finding the “underdamped” range of values € > 0 for which the roots 
S; and Sz are complex; 

2. finding the “critically damped” value of € > 0 that makes the roots sj 
and Ss» equal; and 

3. finding the “overdamped” range of values € > O for which s, and s» 
are real. 

4. For each of these ranges, find the analytical solution for 5; as a 
function of @g and €; write your solutions in Cartesian and polar forms 
and present your results as 
Equation: 


OSES 


o1.2-— € =€ 
yee Ge 


where ¢, is the critically damped value of €. Write a MATLAB 
program that computes and plots 81,9 for Wg fixed at wo = 1 and € 
variable between 0.0 and 2.0 in steps of 0.1. Interpret all of your 
findings. 


Now organize the coefficients of the polynomial s? + 2&s + 1 into the 
array [12€1]. Imbed the MATLAB instructions 


r=roots([1 2*e 1]); 
plot(real(r(1)),imag(r(1)),'o') 
plot(real(r(2)),imag(r(2)),'o') 


in a for loop to compute and plot the roots of s? + 2s + 1 as € ranges 
from 0.0 to 2.0. Note that r is a 1 x 2 array of complex numbers. You 
should observe [link]. We call this “half circle and line” the locus of roots 
for the quadratic equation or the “root locus” in shorthand. 


Roots of Quadratic Equaton 
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Notes to Teachers and Students 


It is essential to write out, term-by-term, every sequence and sum in this 
chapter. This demystifies the seemingly mysterious notation. The example 
on compound interest shows the value of limiting arguments in everyday 
life and gives e” some real meaning. The function e/*, covered in the 
section "The Function of e!® and the Unit Circle and "Numerical 
proceeding to ''Phasors" . The Euler and De Moivre identities provide every 
tool that students need to derive trigonometric formulas. The properties of 
roots of unity are invaluable for the study of phasors in !"Phasors".. 


The MATLAB programs in this chapter are used to illustrate sequences and 
series and to explore approximations to sin @ and cos @. The numerical 


geometrically and algebraically, how approximations to e”” converge. 


“Second-Order Differential and Difference Equations” is a little demanding 
for freshmen, but we give it a once-over-lightly to illustrate the power of 
quadratic equations and the functions e* and e””. This section also gives a 
sneak preview of more advanced courses in circuits and systems. 


Introduction 


It is probably not too strong a statement to say that the function e” is the 
most important function in engineering and applied science. In this chapter 
we study the function e” and extend its definition to the function e””. This 
study clarifies our definition of e”” from "Complex Numbers" and leads us 
to an investigation of sequences and series. We use the function eto 
derive the Euler and De Moivre identities and to produce a number of 
important trigonometric identities. We define the complex roots of unity 
and study their partial sums. The results of this chapter will be used in 
"Phasors" when we study the phasor representation of sinusoidal signals. 
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Many of you know the number e as the base of the natural logarithm, which 
has the value 2.718281828459045. ... What you may not know is that this 
number is actually defined as the limit of a sequence of approximating 
numbers. That is, 

Equation: 


Equation: 


This means, simply, that the sequence of numbers 

ie L\? 
(1+ 1) (1 + 3) : (: + 3) ,..., gets arbitrarily close to 
2.718281828459045. ... But why should such a sequence of numbers be so 
important? In the next several paragraphs we answer this question. 
Exercise: 


Problem: 


(MATLAB) Write a MATLAB program to evaluate the expression 
1 n 

n= (1 + =) forn = 1, 2,4, 8, 16, 32, 64 to show that f, + e 
n 


for large n. 


1 n 
Derivatives and the Number e. The number f,, = (: + =) arises in 
n 


the study of derivatives in the following way. Consider the function 
Equation: 


(ea, a1 


and ask yourself when the derivative of f(x) equals f(x). The function 


f(a) is plotted in [link] for a > 1. The slope of the function at point z is 
Equation: 


df(x) , qtt+Ar_gz 
= lim 
dx Az—>0 Az 
. Az _1 
= a” lim 
Az—>0 Ax 


The Function f (x) = a® 


If there is a special value for a such that 
Equation: 


Az 
aW~” —] 

li SSS 
es Ax 4 


d 
then es f (x) would equal f(x). We call this value of a the special (or 


exceptional) number e and write 
Equation: 


The number e would then be e = f(1). Let's write our condition that 
aft —1 
Ax 
Equation: 


converges to 1 as 


e(* _12 Az, Azsmall 


or as 
Equation: 


e (1+ Ax)“. 


i 1 
Our definition of e = lim (: + =| amounts to defining Ax = — 


and allowing n —> oo in order to make Ax —> 0. With this definition for e, 


it is clear that the function e* is defined to be (e)” 


Equation: 
e* =lim (1+ Az)’. 


Az—0 


z - 
By letting Az = — we can write this definition in the more familiar form 
n 


Equation: 


in (1+ =)" 
n 


n—- Oo 


This is our fundamental definition for the function e*. When evaluated at 
x = 1, it produces the definition of e given in [link]. 


The derivative of e* is, of course, 
Equation: 


This means that Taylor's theorem[footnote] may be used to find another 
characterization for e” : 

Taylor's theorem says that a function may be completely characterized by 
all of its derivatives (provided they all exist). 

Equation: 


et 


| 
7 8 
(an) 
| a | 
Q 
ass 
fav) 
8 
8 | a | 
3.8 
iy 
S|K 
Q 
3 


When this series expansion for e” is evaluated at x = 1, it produces the 
following series for e: 
Equation: 


“1 
=e ar 


n=0 


In this formula, n! is the product n(n — 1)(n — 2)---(2)1. Read n! as "n 
factorial.” 
Exercise: 


Problem: (MATLAB) Write a MATLAB program to evaluate the sum 
Equation: 


for N = 1, 2,4, 8, 16, 32, 64 to show that Sy = e for large NV. 
Compare S¢4 with f¢4 from [link]. Which approximation do you 
prefer? 


Compound Interest and the Function @*. There is an example from your 
everyday life that shows even more dramatically how the function e” arises. 
Suppose you invest Vo dollars in a savings account that offers 1002% 
annual interest. (When z = 0.01, this is 1%; when x = 0. 10, this is 10% 
interest.) If interest is compounded only once per year, you have the simple 
interest formula for V;, the value of your savings account after 1 compound 
(in this case, 1 year): 


V, = (14+ 2)Vo. This result is illustrated in the block diagram of Figure 
2.2(a). In this diagram, your input fortune Vo is processed by the “interest 
block” to produce your output fortune V;,. If interest is compounded 
monthly, then the annual interest is divided into 12 equal parts and applied 
12 times. The compounding formula for V7», the value of your savings after 
12 compounds (also 1 year) is 

Equation: 


Ve= (1 # >) A 


This result is illustrated in [link]. Can you read the block diagram? The 
general formula for the value of an account that is compounded n times per 
year is 

Equation: 


v= (1 ns =)'Vo. 


V,, is the value of your account after n compounds in a year, when the 
annual interest rate is 100x%. 


Block Diagram for Interest Computations; (a) Simple Annual Interest, 
and (b) Monthly Compounding 


= = 2 Zi2 
Vio = eo, = cre B12 Vig coe = (1+2)'7Y 


Exercise: 


Problem: 


Verify in [link] that a recursion is at work that terminates at V,,. That 
x 

is, Show that Vi. = (1 + =\y, forz = 0,1,...,n — 1 produces the 
n 


result V,, = (1 - =) Vo. 
n 


Bankers have discovered the (apparent) appeal of infinite, or continuous, 
compounding: 
Equation: 


n-> Co 


V., =lim (1 Es =) "Vo. 
n 


We know that this is just 
Equation: 


Voo = eV. 


So, when deciding between 100z,; % interest compounded daily and 
100x2% interest compounded continuously, we need only compare 
Equation: 


365 
(1 + <5) versus e*?. 


We suggest that daily compounding is about as good as continuous 
compounding. What do you think? How about monthly compounding? 
Exercise: 


Problem: 


(MATLAB) Write a MATLAB program to compute and plot simple 
interest, monthly interest, daily interest, and continuous interest versus 
interest rate 100z. Use the curves to develop a strategy for saving 
money. 
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Let's try to extend our definitions of the function e” to the argument x = 70. 
Then e/ is the function 
Equation: 


. Q 
ec —lim (: +i] 
n—-0o n 


6 
The complex number 1 + 7— is illustrated in [link]. The radius to the point 
n 


7 Ba ( 
L+-9— 187 = (: + =r and the angle is ¢ =tan~' — This means that 
n n n 


th . 6 : n 6g? ye 
the n”" power of 1 + 7— has radius r” = (1 + £) and angle 
n 
0 
np=n tan’ — (Recall our study of powers of z.) Therefore the complex 


4) n 
number (: +7 ~| may be written as 
n 


Equation: 


6\" g2\"? 1O6\ 1 8 
ee = eer cos {| n tan as + jsin | n tan I 


92 \"/? 6 
For n large, (1 -- i 2~1,andntan | — & n— = 6. Therefore 
n n 


= | 


Pe ae ; 
1+ j—]_ is approximately 
n 
Equation: 
.0\" a 
1+j—] =1{cos#+j7sin@}. 
n 


(: tie] = 1(cos 0+ 7 sin 6). 
n 


This finding is consistent with our previous definition of e7° ! 


The Complex Number 1 + jt 


The series expansion for e/9 is obtained by evaluating Taylor's formula at 
C= 90: 
Equation: 


: de n 
ef? — » ay io) 


When this series expansion for e/? is written out, we have the formula 
Equation: 


= eae) + i Ge 


fore) —1)" n : ie,0) iy n 
= aa a 6? 19D ean en. 


It is now clear that cos @ and sin @ have the series expansions 
Equation: 


oo (—1)” 2n 
cos 9 = ae On)! 7 


Equation: 
sin 0= ee eS 
2. n+ iy 


When these infinite sums are truncated at N — 1, then we say that we have N- 
term approximations for cos @ and sin 0: 


Equation: 
_1 (-1)" 
cos 9 = Nt ge 
Equation: 
N-1 n 
(-1)" pent 
sin 9 = paar 
Ds (2n + 1)! 


The ten-term approximations to cos @ and sin @ are plotted over exact 
expressions for cos @ and sin @ in [link]. The approximations are very good 
over one period (0 < @ < 2zr), but they diverge outside this interval. For more 
accurate approximations over a larger range of 6’s, we would need to use more 
terms. Or, better yet, we could use the fact that cos 6 and sin @ are periodic in 0 
. Then we could subtract as many multiples of 27 as we needed from @ to bring 
the result into the range [0, 27] and use the ten-term approximations on this new 
variable. The new variable is called 0-modulo 27. 


sine and cosine 


angle in radians 


Ten-Term Approximations to cos @ and sin 8 


Exercise: 


Problem: 


Write out the first several terms in the series expansions for cos @ and 
sin 0. 


Demo 2.1 (MATLAB). Create a MATLAB file containing the following demo 
MATLAB program that computes and plots two cycles of cos @ and sin 0 
versus 8. You should observe [link]. Note that two cycles take in 2(27) radians, 
which is approximately 12 radians. 


clg; 

j = sqrt(-1); 

theta = 0:2*pi/50:4*pi; 

S = Sin(theta); 

c = cos(theta); 
plot(theta,s); 
elabel('theta in radians'); 
ylabel('sine and cosine'); 
hold on 

plot(theta,c); 

hold off 


2 0.5 
2 
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co 
2 
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angle in radians 
The Functions cos @ and sin 0 
Exercise: 
Problem: 


(MATLAB) Write a MATLAB program to compute and plot the ten-term 
approximations to cos @ and sin @ for @ running from 0 to 2(277) in steps 
of 27/50. Compute and overplot exact expressions for cos @ and sin 6. 
You should observe a result like [link]. 


The Unit Circle. The unit circle is defined to be the set of all complex numbers 
z whose magnitudes are 1. This means that all the numbers on the unit circle 
may be written as z = ec? We say that the unit circle consists of all numbers 
generated by the function z = e” as 0 varies from 0 to 27. See [link]. 


A Fundamental Symmetry. Let's consider the two complex numbers z; and 


1 
—, illustrated in [link]. We call —-the “reflection of z through the unit circle” 
Ai ail 
1 1 ; 
(and vice versa). Note that z; = rye" and == —e) The complex 


Zi sie 
numbers z; — e/? and = — e?® are illustrated in [link]. The magnitude squared 
a1 
of each is 
Equation: 


Equation: 


* 
cal 


— eI? 


2 
(2-2) 
z, a 


The ratio of these magnitudes squared is 
Equation: 


(z1 = el”) (z; = e 7) 


G3) (b= 
zy #1 


This ratio may be manipulated to show that it is independent of 0, meaning that 


B= 


the points z; and —~ maintain a constant relative distance from every point on 
seal 

the unit circle: 

Equation: 


e/? (ez, —1) (z,e—1) e 9 


a (l-etaiane wy 


x = 


= |z,|*, independent of6! 


This result will be of paramount importance to you when you study digital 
filtering, antenna design, and communication theory. 


unit circle 


The Unit Circle 


Exercise: 


Problem: Write the complex number z — e9 as rei’, What are r and yp? 
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The Euler and De Moivre identities are the fundamental identities for 
deriving trigonometric formulas. From the identity e7” =cos 9 + jsin 6 and 


: ; * 
the conjugate identity e~J? = (e”) =cos 9 — jsin 6, we have the Euler 
identities for cos @ and sin 6: 


Equation: 
_ e®@+~e—# 
cos9 = 5 
. _ e0 —jo 
sin@d = 2; 


These identities are illustrated in [link]. 


Ij 


eJ 


Euler's Identities 


The identity e39 =cos 6 + j sin @ also produces the De Moivre identity: 
Equation: 


(cos0+j sin 0)" =  (e9)" = ein? 


cos né + j sin nd 


When the left-hand side of this equation is expanded with the binomial 
expansion, we obtain the identity 
Equation: 


S° & (cos 9)” *(j sin g)* =cos n@+ j sin no. 


Binomial Coefficients and Pascal's Triangle. The binomial coefficients 
Cy in [link] are shorthand for the number 


Equation: 


n\ ni k—-01 
‘Gamer ee ee 


This number gives the coefficient of z”~*y* in the expansion of (x + y)”. 


How do we know that there are ‘. terms of the form 2” "y*? One way to 


answer this question is to use Pascal's triangle, illustrated in [link]. Each 
node on Pascal's triangle shows the number of routes that terminate at that 
node. This number is always the sum of the number of routes that terminate 
at the nodes just above the node in question. If we think of a left-hand path 
as an occurrence of an x and a right-hand path as an occurrence of a y, then 
we aa Pascal's triangle keeps track of the number of occurrences of 

7 eam ae 


Pascal's Triangle and the Binomial Coefficients 


Exercise: 


Problem: Prove (7) 


| 
——™ 
Vo 
= 
eee 


Exercise: 


Problem: Find an identity for aes, + 7 


Exercise: 


Problem: Find “half-angle” formulas for cos 26 and sin 26. 
Exercise: 
Problem: Show that 


A. cos 36 =cos2 6 — 3cos Osin 260; 
B. sin 30 = 34 cos 26sin 0— sin? 6. 


Exercise: 
Problem: 


Use eJ(9+%) — ¢%1¢I — (cos 6; + j sin 0) (cos 02 + j sin 62) to 
prove 


a. cos (8; + 62) =cos 9; cos 62 sin 0; sin 9, ; 
b. sin (6; + 02) =sin 0; cos 02+ sin 62 cos 6}. 


The Functions e/x and e/j60: Roots of Unity and Related Topics 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


The complex number z = e/27/N ic illustrated in [link]. It lies on the unit 
circle at angle 9 = 27/N. When this number is raised to the n*" power, the 
result is 2” = e/2*"/N This number is also illustrated in [link]. When one 
of the complex numbers e/?7”/% is raised to the N*" power, the result is 
Equation: 


2an/N\* i2 
ae) =e" 1, 


The Complex Numbers e/27/% and e/277/ 


We say that e/27”/ is one of the N*™ roots of unity, meaning that e727"/% 


is one of the values of z for which 
Equation: 


2-1-0. 


There are N such roots, namely, 
Equation: 


ePm/N n= 0,1,...,N—1. 


As illustrated in [link], the 12*" roots of unity are uniformly distributed 
around the unit circle at angles 27/12. The sum of all of the N™ roots of 
unity is zero: 

Equation: 


N-1 


Sy = see = (), 


n=0 


This property, which is obvious from [link], is illustrated in [link], where 


k-1 
the partial sums S$; = >» e?™/N are plotted for k = 1,2,..., N. 
n=0 


j2m 3/12 
j2m 4/12 j2m 2/12 
Cc 
j2m 5/12 senile 
e e 
ence 29 0/12 j 212/12 
726/12 @" a e? 
27/12 j2mli/i2 
a € 
| 20 8/12 j2mids/i2 
¢ € 
j2m 9/12 
€ 
Roots of Unity 


These partial sums will become important to us in our study of phasors and 
light diffraction in ""Phasors" and in our discussion of filters in "Filtering". 


tr 


P| 
-4 


Co 


Partial Sums of the Roots of Unity 


Geometric Sum Formula. It is natural to ask whether there is an analytical 
expression for the partial sums of roots of unity: 
Equation: 


k-1 
Si = S ere, 
n=0 


We can imbed this question in the more general question, is there an 
analytical solution for the “geometric sum” 
Equation: 


The answer is yes, and here is how we find it. If z = 1, the answer is 
S;, = k. If z 4 1, we can premultiply S; by z and proceed as follows: 
Equation: 


See ye a 
= - Seah 
Set 2* — 1. 


From this formula we solve for the geometric sum: 
Equation: 


This basic formula for the geometric sum S; is used throughout 
electromagnetic theory and system theory to solve problems in antenna 
design and spectrum analysis. Never forget it. 


Exercise: 
k-1 k-1 
Problem: Find formulas for S$; = ) e”9 and for S$, = ) gen Nt 
Exercise: 


NA 
Problem: Prove be ef2m/N — Q) 


n=0 


Exercise: 


Problem: 


Find formulas for the magnitude and phase of the partial sum 
k—-1 


Ss. = ys ejann/N- 


n=0 
Exercise: 


Problem: 
(MATLAB) Write a MATLAB program to compute and plot the partial 


k-1 
sumn.S,°= S° eP™IN tor k = 1,2,..., N. You should observe [link]. 


n=0 


Exercise: 


Problem: Solve the equation (z + 1)° = z°. 


Exercise: 


Problem: Find all roots of the equation z* + 27 + 3z—15 =0. 
Exercise: 


Problem: 


Find c so that (1 + 7) is a root of the equation z1’ + 224° —-c=0. 
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With our understanding of the functions e”, eI? and the quadratic equation 
22 + tz + — = 0, we can undertake a rudimentary study of differential 
and difference equations. 

Differential Equations. In your study of circuits and systems you will 


encounter the homogeneous differential equation 
Equation: 


d2 


d 
Geo (t) +Qa,—2£ (t) +a, = 0. 


dt 


Because the function e* reproduces itself under differentiation, it is 
plausible to assume that x () = e* is a solution to the differential 
equation. Let's try it: 

Equation: 


£. (e%) + ais (e*) + a (e*) =0 
(s? + ays + ag)e™ =0. 


If this equation is to be satisfied for all t, then the polynomial in s must be 
zero. Therefore we require 
Equation: 


s*+ajs+a,=0. 


As we know from our study of this quadratic equation, the solutions are 
Equation: 


This means that our assumed solution works, provided s = sj or S9. Itisa 
fundamental result from the theory of differential equations that the most 
general solution for x(t) is a linear combination of these assumed solutions: 
Equation: 


a(t) = A,e* + Ane®. 


If a? — Adz is less than zero, then the roots s; and s2 are complex: 
Equation: 


at ot ) 
$19 > + 55 [401 — ay. 


Let's rewrite this solution as 
Equation: 


$12 = C0 + qw 
where o and w are the constants 


Equation: 


Equation: 


1 
w= 5 \V4a2 —a?. 


With this notation, the solution for «(t) is 
Equation: 


x (t) = Aye“ e 4 Ase eI. 


If this solution is to be real, then the two terms on the right-hand side must 
be complex conjugates. This means that Ay = A, and the solution for x(t) 
is 

Equation: 


. * 
A,e%eit zit A,e%e Ju 


2Re{ Aje“e™}. 


x (t) 


The constant A; may be written as A; = |Ale/”. Then the solution for x(t) 
is 
Equation: 


z(t) = 2|Ale™ cos (wt + y). 


This “damped cosinusoidal solution” is illustrated in [link]. 


The Solution to a Second-Order Differential Equation 


Exercise: 


Problem: 


Find the general solutions to the following differential equations: 


d? d 
a. —, X(t) +22 (t)+2=0; 
b ae 
dt? dt = 
d 


Difference Equations. In your study of digital filters you will encounter 
homogeneous difference equations of the form 
Equation: 


Ln + A1Lyn_-1 + A22n_2 = O. 


What this means is that the sequence{x,, } obeys a homogeneous recursion: 
Equation: 


Ln = —A1%n-1 — A27n-2. 


A plausible guess at a solution is the geometric sequence 7, = z”. With 
this guess, the difference equation produces the result 
Equation: 

2 +az2™! +a,z”"7 =0 


(1+ ayz t+ aye a" =0 


If this guess is to work, then the second-order polynomial on the left-hand 
side must equal zero: 


1l+ajz++aqz77 =0 
Equation: 


1+ajz-1+a9z>7 =0 


2 +ajz+az=0. 


The solutions are 
Equation: 


= a «1 2 
A a ae + j24/4a. — ay 


The general solution to the difference equation is a linear combination of 
the assumed solutions: 
Equation: 


Ln Aj,2z + Ag (z)" 
= Aizit+Aj,(a)" 
= 2 Re{Aiz7} 

= 2\Alr” cos (@n+ yy). 


This general solution is illustrated in [link]. 


2*abs(A)*(r.An) 


The Solution to a Second-Order Difference Equation 


Exercise: 


Problem: 
Find the general solutions to the following difference equations: 
a Ln + 2¢n-1 +2 = 0; 


b. 2, — 24n_-1 +2 = 0; 
CH, 22,5 = 0. 
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We have demonstrated that the function e”” has two representations: 


n— Oo n 
2. ec? — lim (39) 
n—0o k! 


In this experiment, you will write a MATLAB program to evaluate the two 
functions f,, and S;,, for twenty values of n: 


if= (1 +4 #) n= 1,2,..., 20; and 
2. Sp = 3x2, GH in = 1,2,..., 20k 
~YOn k=0 EF? 9 Hy ery é 


Choose 0 = 7/4(= pi/4). Use an implicit For loop to draw and plot a 
circle of radius 1. Then use an implicit for loop to compute and plot f,, and 
an explicit for loop to compute and plot S, forn = 1, 2,...,100. You 
should observe plots like those illustrated in [link]. Interpret them. 


Imaginary 


Imaginary 


0 0.2 0.4 0.6 
Real 


Plots for Convergence;(a) of f;, and (b) of S;, 
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Notes to Teachers and Students: 


Phasors! For those who understand them, they are of incomparable value 
for the study of elementary and advanced topics. For those who 
misunderstand them, they are a constant source of confusion and are of no 
apparent use. So, let's understand them. 

The conceptual leap from the complex number e” to the phasor e/ ert) 
comes in "'Phasor Representation of Signals”. Take as long as necessary to 
understand every geometrical and algebraic nuance. Write the MATLAB 
program in "Exercise 6" to fix the key ideas once and for all. Then use 
phasors to study beating between tones, multiphase power, and Lissajous 
figures in "Beating between Tones" through "Lissajous Figures". We 
usually conduct a classroom demonstration of beating between tones using 
two phase-locked sources, an oscilloscope, and a speaker. We also 
demonstrate Lissajous figures with this hardware. 


"Sinusoidal Steady State and the Series RLC Circuit" and Light Scattering 
by_a Slit" on sinusoidal steady state and light scattering are too demanding 
for freshmen but are right on target for sophomores. These sections may be 
covered in a sophomore course (or a supplement to a sophomore course) or 
skipped in a freshman course without consequence. 


In the numerical experiment in ‘Numerical Experiment (Interference 
Patterns)", students compute and plot interference patterns for two sinusoids 
that are out of phase. 


Introduction 


In engineering and applied science, three test signals form the basis for our 
study of electrical and mechanical systems. The impulse is an idealized 
signal that models very short excitations (like current pulses, hammer 
blows, pile drives, and light flashes). The step is an idealized signal that 
models excitations that are switched on and stay on (like current in a relay 
that closes or a transistor that switches). The sinusoid is an idealized signal 
that models excitations that oscillate with a regular frequency (like AC 
power, AM radio, pure musical tones, and harmonic vibrations). All three 
signals are used in the laboratory to design and analyze electrical and 
mechanical circuits, control systems, radio antennas, and the like. The 
sinusoidal signal is particularly important because it may be used to 
determine the frequency selectivity of a circuit (like a superheterodyne 
radio receiver) to excitations of different frequencies. For this reason, every 
manufacturer of electronics test equipment builds sinusoidal oscillators that 
may be swept through many octaves of Orequency. (Hewlett-Packard was 
started in 1940 with the famous HP audio oscillator.) 


In this chapter we use what we have learned about complex numbers and 
the function e/” to develop a phasor calculus for representing and ma- 
nipulating sinusoids. This calculus operates very much like the calculus we 
developed in "Complex Numbers" and "The Functions e* and e!®" for 
manipulating complex numbers. We ap- ply our calculus to the study of 
beating phenomena, multiphase power, series RLC circuits, and light 
scattering by a slit. 
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There are two key ideas behind the phasor representation of a signal: 


i. a real, time-varying signal may be represented by a complex, time- 
varying signal; and 

ii. a complex, time-varying signal may be represented as the product of a 
complex number that is independent of time and a complex signal that 
is dependent on time. 


Let's be concrete. The signal 
Equation: 


z(t) = Acos (wt + ), 


illustrated in [link], is a cosinusoidal signal with amplitude A, frequency o, 
and phase @. The amplitude A characterizes the peak-to-peak swing of 2.A, 


the angular frequency w characterizes the period 7’ = cali between 
negative- to-positive zero crossings (or positive peaks ep iegace peaks), 
and the phase @ characterizes the time T = — when the signal reaches its 
first peak. With t so defined, the signal x(t) aay also be written as 


Equation: 


z(t) = A cos w(t — 7). 


Acos(@t+@) versus t 


(pees 
Dae 


A Cosinusoidal Signal 


When Tt is positive, then T is a “time delay” that describes the time (greater 
than zero) when the first peak is achieved. When T is negative, then Tis a 
“time advance” that describes the time (less than zero) when the last peak 


27 
was achieved. With the substitution w = ra we obtain a third way of 


2(+) = Acos (er) 


In this form the signal is easy to plot. Simply draw a cosinusoidal wave 
with amplitude A and period T; then strike the origin (¢ = 0) so that the 
signal reaches its peak at t. In summary, the parameters that determine a 
cosinusoidal signal have the following units: 


writing x(t): 
Equation: 


A, arbitrary (e.g., volts or meters/sec, depending upon the application) 


@, in radians/sec (rad/sec) 
T, in seconds (sec) 
@, in radians (rad) 
T, in seconds (sec) 


Exercise: 


Problem: 


2 
Show that x (t) = A cos = (t — T) is “periodic with period T’," 
meaning that z(t + mT’) = x(t) for all integer m. 
Exercise: 


Problem: 


The inverse of the period 7’ is called the “temporal frequency” of the 
cosinusoidal signal and is given the symbol f; the units of f = 7 are 


(seconds) * or hertz (Hz). Write x(t) in terms of f. How is f related 
to w? Explain why f gives the number of cycles of x(t) per second. 


Exercise: 
Problem: 
Sketch the function x (t) = 110 cos [an (60)¢ — = versus t. Repeat 


for x (t) = 5 cos ln (16 x 10°)t + *] and 


8 
A,w,T, f, y, and r. Label your sketches carefully. 


2 10? 
x(t) = 2 cos =. (: _ )| . For each function, determine 


The signal z(t) = A cos (wt + y) can be represented as the real part of a 
complex number: 
Equation: 


a(t) = Re [Ae%*+)] 
Re|Ae/?ei"] . 


We call Ae/¥e/" the complex representation of x(t) and write 
Equation: 


z(t) + Aei?e™, 


meaning that the signal x(t) may be reconstructed by taking the real part of 
Ae/?eJ*, In this representation, we call AeJ? the phasor or complex 
amplitude representation of x(t) and write 

Equation: 


a(t) + Ae’®, 


meaning that the signal x(t) may be reconstructed from Ae/? by 
multiplying with e%* and taking the real part. In communication theory, we 
call AeJ? the baseband representation of the signal x(t). 

Exercise: 


Problem: 


For each of the signals in Problem 3.3, give the corresponding phasor 
representation Ae/”. 


Geometric Interpretation. Let's call 
Equation: 


Ae?” ei! 
the complex representation of the real signal A cos (wt + vy). Att = 0, the 


complex representation produces the phasor 
Equation: 


Ae!” 


This phasor is illustrated in [link]. In the figure, @ is approximately oT If 


we let ¢ increase to time t,, then the complex representation produces the 
phasor 


AeI? Jt | 


eon 
". 


~ Aer%eiot, (t= i) 


’ 


eta oe “hei? emt; (t= t,) 


Rotating Phasor 


We know from our study of complex numbers that e/“*1 just rotates the 
phasor Ae/’’ through an angle of wt, ! See [link]. Therefore, as we run t 
from 0, indefinitely, we rotate the phasor Ae?” indefinitely, turning out the 


2 
circular trajectory of [link]. When t = ail then et = eJ2" — J, 


an 
Therefore, every (=) seconds, the phasor revisits any given position on 
W 
the circle of radius A. We sometimes call Ae??ei4 
rotation rate is the frequency w: 
Equation: 


a rotating phasor whose 


This rotation rate is also the frequency of the cosinusoidal signal 
A cos (wt + y). 


In summary, Ae/’e* is the complex, or rotating phasor, representation of 
the signal A cos (wt + y). In this representation, e/“* rotates the phasor 
Ae/? through angles wt at the rate w. The real part of the complex 
representation is the desired signal Acos (wt + wy). This real part is read off 
the rotating phasor diagram as illustrated in [link]. In the figure, the angle @ 


Tv . 
is about — To As we become more facile with phasor representations, we 


will write x (t) = Re [x eit) and call Xe“ the complex representation 


and X the phasor representation. The phasor X is, of course, just the phasor 
Ae/?. 


Reading a Real Signal from a Complex, 
Rotating Phasor 


Exercise: 
Problem: 
Sketch the imaginary part of Ae”e%” to show that this is 


A sin (wt + y). What do we mean when we say that the real and 
imaginary parts of Ae??eI are "90° out of phase"? 


Exercise: 


Problem: 


(MATLAB) Modify Demo 2.1 in "The Function e* and e!" so that 

9 = wt, with w an input frequency variable and ¢ a time variable that 
27 on \ . 27 

ranges from —2{ —— }]to + 2{ — ] insteps of 0.02 {| — ]. In your 
W W Ww 

modified program, compute and plot e*, Re [e*], and Im[e*] for 


2 2 2 , 
22 (=) <t=2 (=) in steps of 0.02 (=). Io el ina 
w Ww w 


two-dimensional plot to get a picture like [link] and plot Re lem*| and 
Im [e™*| versus ¢ to get signals like those of [link]. You should 


observe something like [link] using the subplot features discussed in 
An Introduction to MATLAB. (In the figure, w represents Greek w.) 


exp(jwt) 


The Functions e*, Re[e*|, and Im|e”*] 


Positive and Negative Frequencies. There is an alternative phasor 
representation for the signal 7(¢) = A cos (wt + w). We obtain it by using 


1 


the Euler formula of "The Function eX ifr namely, cos 0 = 2 (e%” + on) 
. When this formula is applied to x(t), we obtain the result 


Equation: 


£ (t) 4 [edwet+y) = e iut+¢)| 


= 4 elPejut -++ Ze We jut, 


A =; 
In this formula, the term a ee)" is a rotating phasor that begins at the 
phasor value Ae’? (for t = 0) and rotates counterclockwise with frequency 


w. The term 7 e /’e is a rotating phasor that begins at the (complex 


A. 
conjugate) phasor value > e /? (for t = 0) and rotates clockwise with 


(negative) ffequency w. The physically meaningful frequency for a cosine 
is @, a positive number like 27(60) for 60 Hz power. There is no such thing 
as a negative frequency. The so-called negative frequency of the term 


A 2.3 
ae J? e~J*" just indicates that the direction of rotation for the rotating 


phasor is clock-wise and not counterclockwise. The notion of a negative 
frequency is just an artifact of the two-phasor representation of A 

cos (wt + w). In the one-phasor representation, when we take the “real 
part,” the artifact does not arise. In your study of circuits, systems theory, 
electromagnetics, solid-state devices, signal processing, control, and 
communications, you will encounter both the one- and two-phasor 
representations. Become facile with them. 

Exercise: 


Problem: 


Sketch the two-phasor representation of A cos (wt + w). Show clearly 
how this representation works by discussing the counterclockwise 
rotation of the positive frequency part and the clockwise rotation of the 
negative frequency part. 


Adding Phasors. The sum of two signals with common frequencies but 
different amplitudes and phases is 
Equation: 


A, cos (wt + 1) + Ag cos (wt + yo). 
The rotating phasor representation for this sum is 
Equation: 
(Aje?1 a Aze!??) el", 
The new phasor is A,e/*! + Aye4?2, and the corresponding real signal is 


a(t) = Re[(Aje!*! + Ape”2) e*]. The new phasor is illustrated in 
[link]. 


Adding Phasors 


Exercise: 


Problem: 


Write the phasor A,e/?! + AyeJ?2 as Aze/?3 ; determine A3 and v3 in 
terms of A;, Az, Yi, and yz. What is the corresponding real signal? 


Differentiating and Integrating Phasors. The derivative of the signal 
A cos (wt + y) is the signal 
Equation: 
A cos(wt+y) = —wA sin (wt + y) 
= —Im|wAe%e| 
= Re | jwAel? elt) 


= Re [wei"/? AeWeit| 


This finding is very important. It says that the derivative of A cos (wt + y) 
has the phasor representation 
Equation: 

+ A cos (wt + ) & iwAel? 

dt ? J 


és west? fei, 


These two phasor representations are entirely equivalent. The first says that 
the phasor Ae?” is complex scaled by jw to produce the phasor for 


oA cos (wt+w), and the second says that it is scaled by w and phased by 
+1 /2. The phasor representations of A cos (wt + wy) and 

oA cos (wt + vy) are illustrated in Figure 3.6. Note that the derivative 
“leads by 2/2 radians (90)." 


Differentiating and Integrating Phasors 


The integral of A cos (wt + y) is 


Equation: 
f Acos (wt+ y)dt = 4 sin (wt + y) 
= m[A el? elt] 
= Re[—j4e¥ei*] 


= Re [4 eieit| 
= Re[te%/? Acie"), 
This finding shows that the integral of A cos (wt + ) has the phasor 


representation 
Equation: 


fAcos(wt+yp)dt o Fg Ae 


oe Lett fel”. 


| 1 
The phasor Ae/” is complex scaled by —— or scaled by — and phased by 
qw W 


e J™/2 to produce the phasor for rl A cos («1 + °) dt. This is illustrated 


in [link]. Note that the integral “lags by 7/2 radians (90). Keep these 
geometrical pictures of leading and lagging by 7/2 in your mind at all 
times as you continue your more advanced study of engineering. 


An Aside: The Harmonic Oscillator. The signal A cos (wt + y) stands 
on its own as an interesting signal. But the fact that it reproduces itself (with 
scaling and phasing) under differentiation means that it obeys the second- 
order differential equation of the simple harmonic oscillator.[footnote] That 
is, the differential equation 

This means, also, that we have an easy way to synthesize cosines with 
circuits that obey the equation of a simple harmonic oscillator! 

Equation: 


dz (t) 
dt? 


+ wax (t) =0 


has the solution 


Equation: 
x(t) = A cos (wt + y). 
Try it: 
Equation: 
d? d 
z(t) = —[—Aw sin (wt + y)] = —w*A cos (wt + ). 


di.’ dt 


The constants A and @ are determined from the initial conditions 
Equation: 


xz (0) = Acos y x” (0)+ 27 (#) =A’ 
=> 
a(<)=-Asingy ae =tan ~ 
Exercise: 
Problem: 


Show how to compute A and y in the equation 
x(t) = A cos (wt + y) from the initial conditions x(O) and 


d 
ae (t) |s—o- 


Phasors: Beating between Tones 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Perhaps you have heard two slightly mistuned musical instruments play 
pure tones whose frequencies are close but not equal. If so, you have sensed 
a beating phenomenon wherein a pure tone seems to wax and wane. This 
waxing and waning tone is, in fact, a tone whose frequency is the average 
of the two mismatched frequencies, amplitude modulated by a tone whose 
“beat” frequency is half the difference between the two mismatched 
frequencies. The effect is illustrated in [link]. Let's see if we can derive a 
mathematical model for the beating of tones. 


We begin with two pure tones whose frequencies are Wg + v and wo — V 
(for example, wo = 27 x 10°rad/ sec and v = 27 rad / sec). The 
average frequency is @ , and the difference frequency is 2v. What you hear 
is the sum of the two tones: 

Equation: 


a(t) = A, cos [(wo + v)t + pi] + Ag cos [(wo — v)t + vol. 


The first tone has amplitude A; and phase (yj ; the second has amplitude 
Az and phase (2. We will assume that the two amplitudes are equal to A. 
Furthermore, whatever the phases, we may write them as 

Equation: 


Yi=y+y and ~o=y~-wH 
y=Fs(~ity2) and p= $(y1— ¥2). 


Ml 
a Wi 


Beating between Tones 


Recall our trick for representing x(t) as a complex phasor: 
Equation: 
= 2A Re ef(t+#) cos, (vt + ) 
= 2A cos (wot + y) cos (vt+ yx). 


This is an amplitude modulated wave, wherein a low frequency signal with 
beat frequency v rad/sec modulates a high frequency signal with carrier 
frequency wo rad/sec. Over short periods of time, the modulating term 

cos (vt+~) remains essentially constant while the carrier term 

cos (wot + y) turns out many cycles of its tone. For example, if ¢ runs 


2 
from 0 to nm (about 0.1 seconds in our example), then the modulating 
V 


wave turns out just 1/10 cycle while the carrier turns out LOvw , cycles 
(about 100 in our example). Every time vt changes by 27 radians, then the 
modulating term goes from a maximum (a wax) through a minimum (a 
wane) and back to a maximum. This cycle takes 

Equation: 


2 
iy em ** seconds, 
V 


which is 1 second in our example. In this 1 second the carrier turns out 
1000 cycles. 
Exercise: 


Problem: 


Find out the frequency of A above middle Con a piano. Assume two 
pianos are mistuned by +1Hz(+2zrad/sec). Find their beat 
frequency v and their carrier frequency wo. 


Exercise: 


Problem: (MATLAB) Write a MATLAB program to compute and plot 


A cos |(wp + v)t + yi], A cos [(wo — v)t + Yo], and their sum. Then 
compute and plot 2A cos (wot + wy) cos (vt + y). 


Verify that the sum equals this latter signal. 


Phasors: Multiphase Power 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


The electrical service to your home is a two-phase service.| footnote] This 
means that two 110 volt, 60 Hz lines, plus neutral, terminate in the panel. 
The lines are 7 radians (180) out of phase, so we can write them as 

It really is, although it is said to be “single phase” because of the way it is 
picked off a single phase of a primary source. You will hear more about this 
in circuits and power courses. 


Equation: 
a(t) = 110 cos [27 (60)t+y] = Re {110el?7(60)e+e] } 
an Re{ X,e27(60}t} 
X, = 110e” 
z2(t) = 110 cos [27(60)t+y+7| = Re £110e92"(60)ete +a] 
- Re{ Xpe227(60)t} 


Xo = 110e%¥+"), 


These two voltages are illustrated as the phasors X, and X> in [Link]. 


Phasors in Two-Phase Power 


You may use 2 (t) to drive your clock radio or your toaster and the 
difference between 2 (t) and x2 (t) to drive your range or dryer: 
Equation: 


x1 (t) — x2 (t) = 220 cos [27(60)t + vy). 
The phasor representation of this difference is 
Equation: 
X,— X>_ = 220e”. 
The breakers in a breaker box span the 21-to-neutral bus for 110 volts and 


the 11-to-x» buses for 220 volts. 
Exercise: 


Problem: Sketch the phasor X; — X 2 on [link]. 


Most industrial installations use a three-phase service consisting of the 
signals x, (t), £2 (t), and 23 (t): 
Equation: 


tn (t) = 110Re ‘Cage 4 Xp = 110e"?7/9), n = 1,2,3. 


The phasors for three-phase power are illustrated in [link]. 
Exercise: 

Problem: 

Sketch the phasor X2 — Xj corresponding to x2 (t) — x (¢) on 

[link]. Compute the voltage you can get with x (t) — x, (t). This 


answer explains why you do not get 220 volts in three-phase circuits. 
What do you get? 


Xx, 


Three-Phase Power 


Constant Power. Two- and three-phase power generalizes in an obvious 
way to N-phase power. In such a scheme, the NV signals x,(n = 
0,1,...,M —1) are 

Equation: 


2 
Ln (t) A cos G + rn) 


— Re [Ae2nr/N eit) is X= Ae?2tm/N | 


The phasors X,, are Ae?2"("/") The sum of all NV signals is zero: 
Equation: 


ma tn(t) = Re{ADncg e?™/Nemt} 


1-e?" jut 
Re{ A ae \ 


= 0. 


But what about the sum of the instantaneous powers? Define the 
instantaneous power of the n* signal to be 
Equation: 


Ss 
3 
— 
oe 
—” 
| 
8 
3h 
en 
oe 
—” 
| 


A? cos? (wt + tn) 
= a + A cos (Quit a 227 n) 


- zy Re { A ein/ Nemes | 


The sum of all instantaneous powers is (see [link]) 
Equation: 


N-1 A2 
> Pn lt) =N—, 
n=0 


and this is independent of time! 
Exercise: 


Problem: 


Carry out the computations of Equation 3.33 to prove that 
instantaneous power P is constant in the N-phase power scheme. 


Phasors: Lissajous Figures 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Lissajous figures are figures that are turned out on the face of an 
oscilloscope when sinusoidal signals with different amplitudes and different 
phases are applied to the time base (real axis) and deflection plate 
(imaginary axis) of the scope. The electron beam that strikes the 
phosphorous face then had position 

Equation: 


z(t) = Az cos (wt + pz) + jAy cos (wt + yy). 


In this representation, A, cos (wt + y,) is the “x-coordinate of the point,” 
and A, cos (wt + ) is the “y-coordinate of the point.” As time runs from 
0 to infinity, the point z(¢) turns out a trajectory like that of [link]. The 


Z 
figure keeps overwriting itself because z(t) repeats itself every = 
Ww 


seconds. Do you see why? 


IM) xf) 


Lissajous Figure on Oscilloscope Screen 


Exercise: 
Problem: 
Find the intercepts that the Lissajous figure makes with the real and 


imaginary axes in [link]. At what values of time are these intercepts 
made? 


Exercise: 


Problem: 


Show that the Lissajous figure z(t) = A, cos (wt + y,)+ 
j Ay cos (wt + yy) is just the rotating phasor Ae’“**+*) when 


T 
Az = Ay = A, Go = p, and py = 9 + 5. 


Two-Phasor Representation. We gain insight into the shape of the 
Lissajous figure if we use Euler's formulas to write z(t) as follows: 
Equation: 


z(t) = Ae ellettys) + -suttya) + je el(wtt+py) 4. e-i(wt+py) 
AgeiPr+jAyel?y yg | Age iAyeHY ju 
_ Se Se 


This representation is illustrated in [link]. It consists of two rotating 
phasors, with respective phasors B, and Bz : 


Equation: 
z(t) = Bye + Boe 
_ Azei?t + 7A yey 
BL= —— 
AzeI?@ 4 jA,eIPY 
By = —._ a 
B, 
: -Ayo-IP 
javeity  |jque Ny 


Sa este 


as eI Pax 


Two-Phasor Representation of a Lissajous Figure 


As t increases, the phasors rotate past each other where they constructively 
add to produce large excursions of z(t) from the origin, and then they rotate 
to antipodal positions where they destructively add to produce near 
approaches of z(t) to the origin. 


In electromagnetics and optics, the representations of z(t) given in [link] 
and [link] are called, respectively, linear and circular representations of 
elliptical polarization. In the linear representation, the x- and y-components 
of z vary along the horizontal and vertical lines. In the circular 
representation, two phasors rotate in opposite directions to turn out circular 
trajectories whose sum produces the same effect. 

Exercise: 


Problem: 


(MATLAB) Write a MATLAB program to compute and plot the 
Lissajous figure z(t) when A, = 1/2, A, = 1, yz = 0, and 

(y = 7/6. Discretize t appropriately and choose an appropriate range 
of values for t. 


Phasors: Sinusoidal Steady State and the Series RLC Circuit 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Phasors may be used to analyze the behavior of electrical and mechanical 
systems that have reached a kind of equilibrium called sinusoidal steady 
state. In the sinusoidal steady state, every voltage and current (or force and 
velocity) in a system is sinusoidal with angular frequency w. However, the 
amplitudes and phases of these sinusoidal voltages and currents are all 
different. For example, the voltage across a resistor might lead the voltage 


1 
across a Capacitor by 90° ( — radians) and lag the voltage across an 


inductor by 90° (= radians) 


In order to make our application of phasors to electrical systems concrete, 
we consider the series RLC circuit illustrated in [link]. The arrow labeled 
i(t) denotes a current that flows in response to the voltage applied,and the + 
and - on the voltage source indicate that the polarity of the applied voltage 
is positive on the top and negative on the bottom. Our convention is that 
current flows from positive to negative, in this case clockwise in the circuit. 


peereereeneenesncene ene sseeverseeeeneewesteatereeeses, 


{ @) voltage source 
i AAW resistor 


$ 
$ 
$ 
$ 
i 
L SOO inductor 
$ 4 
i ~___ capacitor i 
Series RLC Circuit 


We will assume that the voltage source is an audio oscillator that pro- duces 
the voltage 
Equation: 


V(t) = A cos (wt + y). 

We represent this voltage as the complex signal 
Equation: 

V (t) & Ae’?e™ 
and give it the phasor representation 
Equation: 

V(t) O V;V = Ae”. 

We then describe the voltage source by the phasor V and remember that we 


can always compute the actual voltage by multiplying by e/“* and taking 
the real part: 


Equation: 
V (t) = Re {Ve™}. 
Exercise: 


Problem: Show that Re [Vem] = A cos (wt + y) when V = Ae”, 


Circuit Laws. In your circuits classes you will study the Kirchhoff laws 
that govern the low frequency behavior of circuits built from resistors (R), 
inductors (L), and capacitors (C’). In your study you will learn that the 
voltage dropped across a resistor is related to the current that flows through 
it by the equation 

Equation: 


Vr (t) = Ri(t). 


You will learn that the voltage dropped across an inductor is proportional to 
the derivative of the current that flows through it, and the voltage dropped 
across a Capacitor is proportional to the integral of the current that flows 
through it: 

Equation: 


di 
Vi (t) = L(t) 


Vo (t) = & fi(t)dt. 


Phasors and Complex Impedance. Now suppose that the current in the 
preceding equations is sinusoidal, of the form 
Equation: 


i(t) = B cos (wt + 8). 


We may rewrite i(t) as 


Equation: 


i(t) = Re {Ie} 


where I is the phasor representation of i(t). 
Exercise: 


Problem: Find the phasor J in terms of B and @ in [link]. 
The voltage dropped across the resistor is 
Equation: 
Vr(t) =  Ri(t) 
= RRe{Ie} 
= Re{Rle}. 


Thus the phasor representation for Vz (t) is 
Equation: 
Vr (t) > Vr; Ve = ial. 
We call R the impedance of the resistor because R is the scale constant that 
relates the “phasor voltage Vp' to the “phasor current I.” 


The voltage dropped across the inductor is 
Equation: 


V_ (¢) = L— =L, “Re {Ie}. 


The derivative may be moved through the Re| | operator (see [link]) to 
produce the result 
Equation: 


Vz (t) = DRe{jwle™} 
= Re{jwLIe™}. 


Thus the phasor representation of Vz (t) 
Equation: 
Vi, (t) => VL; Vie qWLI. 
We call jw the impedance of the inductor because 7wL is the complex 


scale constant that relates “phasor voltage Vz,' to “phasor current I.” 
Exercise: 


d 
Problem: Prove that the operators at and Re|} commute: 


Equation: 


pe {er} — Re ‘ set. 
dt dt 


The voltage dropped across the capacitor is 
Equation: 


Ve (t) = _ = 3 f Re {Ie \dt. 


The integral may be moved through the Re| | operator to produce the result 
Equation: 


Vet) = e{t eet | 


= Ref tem} 


Thus the phasor representation of Vo(t) is 
Equation: 


I 
Vo (t) @ Vo; Ve = = 
qwC 


is the complex 


° 


1 
We call oO the impedance of the capacitor because 
qw 


scale constant that relates “phasor voltage Vo" to “phasor current I.” 


Kirchhoff's Voltage Law. Kirchhoff's voltage law says that the voltage 
dropped in the series combination of R, L, and C illustrated in [link] equals 
the voltage generated by the source (this is one of two fundamental 
conservation laws in circuit theory, the other being a conservation law for 
current): 

Equation: 


V (t) = Vr (t) + Vz, (t) + Vo (t). 


If we replace all of these voltages by their complex representations, we 
have 
Equation: 


Re{Ve""} = Re{ (Ve +V,+ Vc) eh. 


An obvious solution is 
Equation: 


V= Vret+Vri,+Voe 
(R+ jwb+ po) 


where I is the phasor representation for the current that flows in the circuit. 
This solution is illustrated in [link], where the phasor voltages RI, jwLI, 


and =, are forced to add up to the phasor voltage V. 
qwC’ 
V 
qoOLL RI 


Phasor Addition to Satisfy Kirchhoff's Law 


Exercise: 


Problem: Redraw [link] for R = wl = = I. 


Ww 
1 
Impedance. We call the complex number R + jwL + oO the complex 
qw 


impedance for the series RLC network because it is the complex number 
that relates the phasor voltage V to the phasor current I: 
Equation: 


V=Z1 


Z=R+ jul + oe. 


The complex number Z depends on the numerical values of resistance (R), 
inductance (LZ), and capacitance (C), but it also depends on the angular 
frequency (w) used for the sinusoidal source. This impedance may be 
manipulated as follows to put it into an illuminating form: 

Equation: 


Z 


R+ j(wL — =a) 
R+ iy & (wvEC- ce). 
1 


The parameter wp = Tee. is a parameter that you will learn to call an 


"undamped natural frequency" in your more advanced circuits courses. 
With it, we may write the impedance as 
Equation: 


The frequency aa is a normalized frequency that we denote by v. Then the 


impedence, as a function of normalized frequency, is 
Equation: 


AiR et (v- o) 


V 


When the normalized frequency equals one (v = 1), then the impedance is 
entirely real and Z = R. The circuit looks like it is a single resistor. 
Equation: 


The impedance obeys the following symmetries around v = 1: 
Equation: 


Z(v) = 2" (2) 
av) = 2(2) 


arg Z(v) = —arg Z(+). 
In the next paragraph we show how this impedance function influences the 
current that flows in the circuit. 


Resonance. The phasor representation for the current that flows the current 
that flows in the series RLC circuit is 


Equation: 
= V 
ve Zo) 
— —_1 _ ,—jargZ(v) 
Zon © Me 
The function H (v) = re displays a "resonance phenomenon." that is, 


|H(v)| peaks at vy = 1 and decreases to zero and vy = 0 and v = co: 
Equation: 


y=0 
|H (v)| = = y=1 


When |H(v)| = 0, no current flows. 


w 
The function |H(v)| is plotted against the normalized frequency v =— in 
Ww 


Figure 3.14. The resonance peak occurs at y = 1, where | H(v)| =— 


meaning that the circuit looks purely resistive. Resonance phenomena 
underlie the frequency selectivity of all electrical and mechanical networks. 


| 1 (v)l versus v 


&le 


Resonance in a Series RLC Circuit 


Exercise: 


Problem: 


(MATLAB) Write a MATLAB program to compute and plot | H(v)| 
and arg H(v) versus v for v ranging from 0.1 to 10 in steps of 0.1. 


L 
Carry out your computations for wo R= 10,1, 0.1, and 0.01, and 


overplot your results. 


Circle Criterion and Power Factor. Our study of the impedance Z(v) and 


1 
Z(v) 
circuit and illustrates the ffequency selectivity of the circuit. But there is 
more that we can do to illuminate the behavior of the circuit. 

Equation: 


the function H (v) = brings insight into the resonance of an RLC 


1 
V => RPA wb = L. 
+i(w =) 


This equation shows how voltage is divided between resistor voltage RI and 
inductor-capacitor voltage 7 (wL — =z)I : 


Equation: 


V = RI + juoL (= 2 ma 


Wo Ww 


or 
Equation: 


V = Rie Jeon (v- ) RI. 


In order to simplify our notation, we can write this equation as 
Equation: 


V =Vert+jk(v)VR 


where Vp is the phasor voltage RI and k(v) is the real variable 


Equation: 
_ woL 1 
K(v) =? ( =). 


[link] brings very important geometrical insights. First, even though the 
phasor voltage Vz in the RLC circuit is complex, the terms Vr and 


jk (v)Vp are out of phase by 5 radians. This means that, for every 
allowable value of Vp, the corresponding jk (v)Vz must add in a right 
triangle to produce the source voltage V. This is illustrated in [link]. As the 
frequency v changes, then k(v) changes, producing other values of Vr and 
jk (v) Vp that sum to V. Several such solutions for Vp and jk (v)Vp are 
illustrated in Figure 3.15(b). From the figure we gain the clear impression 


V 
that the phasor voltage Vs lies on a circle of radius - centered at > 


Let's try this solution, 


Equation: 


and explore its consequences. When this solution is substituted into [link], 


the result is 


Equation: 
V ’ V 
Ve ace + e)”) + jk(v) (1 + el”) 
or 
Equation: 

2= (1+e”) (1+ jk(v)]. 

Vv 

GEV) Vp 

VR 


The Components of V; (a) Addition of Vp and jk (v)Vp 
to Produce V, and (b) Several Values of Vp and jk (v)Vp 
that Produce V 


If we multiply the left-hand side by its complex conjugate and the right- 
hand side by its complex conjugate, we obtain the identity 


Equation: 


4 = 2(1+ cos p)[1+ k? (v)]. 


This equation tells us how the angle 7 depends on k(v) and, conversely, 
how k(v) depends on @?: 


Equation: 
1— k? (v) 
cos y= 
v 1+ k? (v) 
Equation: 
PU) 1— cos 
1+ cos w 


The number cos w lies between —1 and +1, so a circular solution does 
indeed work. 
Exercise: 


Problem: 


Check —1 <cos w < 1 for —coo < k < co and —co < k < ow for 
—7n7 <<. Sketch k versus w and w versus k. 


V . 
The equation Vp = oF (1 — el) is illustrated in [link]. The angle that Vp 


makes with V is determined from the equation 
Equation: 


2yta-parsy=* 


The Voltages V and Vp, and the Power Factor 
cos /Y 


In the study of power systems, cos y is a "power factor" that determines 
how much power is delivered to the resistor. We may denote the power 
factor as 

Equation: 


7] =CO8 (P =COS -, 


But cos ~ may be written as 
Equation: 


=cos y =cos — 
1) ? 9 


But cos ~ may be written as 
Equation: 


cosy =cos(y+y) = cos? yp——sin 7p 
cos? y — (1- cos” y) 


- 2 cos? y—1 
2n? — 1. 


Therefore the square of the power factor 7 is 
Equation: 
.  cosp+1 _ 1 
rr, mere ey 2 


The power factor is a maximum of 1 for k(v) = 0, corresponding to v = 1 
(w = wo). It is a minimum of 0 for k(v) = +co, corresponding to 

vy = 0, 0o(w = 0,00). 

Exercise: 


Problem: 


L 1 
With k defined as k(v) = — (» _ =) , plot k? (v), cos w, and 7” 


versus V. 


Exercise: 


Problem: Find the value of v that makes the power factor 7 = 0. 707. 


Phasors: Light Scattering by a Slit 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


One of the most spectacular successes for phasor analysis arises in the study 
of light diffraction by a narrow slit. The experiment is to shine laser light 
through a slit in an otherwise opaque sheet and observe the pattern of light 
that falls on a distant screen. When the slit is very narrow compared with 
the wavelength of the light, then the light that falls on the screen is nearly 
uniform in intensity. However, when the width of the slit is comparable to 
the wavelength of the light, the pattern of light that falls on the screen is 
scalloped in intensity, showing alternating light and dark bands. The 
experiment, and the observed results, are illustrated in [link]. 
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Light Diffraction by a Slit 


Why should this experiment produce this result? Phasor analysis illuminates 
the question and produces an elegant mathematical description of a 
profoundly important optical experiment. 


Huygens's Principle. We will assume, as Christiaan Huygens did, that the 
light incident on the slit sets up a light field in the slit that may be modeled 
by WN discrete sources, each of which radiates a “spherical wave of light.” 
This model is illustrated in [link]. The distance between sources is d, and 
Nd = Lis the width of the slit. Each source is indexed by n, and n runs 
from 0 to N — 1. The 0 source is located at the origin of our coordinate 
system. 


The spherical wave radiated by the n* source is described by the equation 
Equation: 


A. 
E(rmt) =Re {per errr \ 


The function E(r,,, t) describes the “electric field” at time t and distance r, 
from the n* source. The field is constant as long as the variable 

wt — (27/A)r, is constant. Therefore, if we freeze time at t = to, the field 
will be constant on a sphere of radius 7,,. This is illustrated in [link]. 


Fe ae ae 
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Huygens's Model for Light Diffraction 


Exercise: 
Problem: 
Fix r, in E(r,, t) and show that E(r,, t) varies cosinusoidally with 
time ¢. Sketch the function and interpret it. What is its period? 
Exercise: 
Problem: 
Fix t in E(r,, t) and show that E(r,,, t) varies cosinusoidally with 


radius r,,. Sketch the function and interpret it. Call the “period in r,," 
the “wavelength.” Show that the wavelength is A. 


Exercise: 
Problem: 


The “crest of the wave E(r,,, t)" occurs when wt — (27/A)rn = 0. 
Show that the crest moves through space at velocity v = wA/2r. 


Geometry. If we now pick a point P on a distant screen, that point will be 
at distance rg from source 0, ..., 7, from source n, ..., and so on. If we 


isolate the sources 0 and n, then we have the geometric picture of [link]. 
The angle vy is the angle that point P makes with the horizontal axis. The 
Pythagorean theorem says that the connection between distances 79 and r, 
is 

Equation: 


rn —nd sin y)? + nd cos y)? = r?. 
0 
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Geometry of the Experiment 


Let's try the solution 
Equation: 


fn =To + nd sin ~. 


This solution produces the approximate identity 
Equation: 


r + (nd cos y)? = re 


Equation: 
r2 + (nd cos eyes r? 
2 


ee ( 24 cos y) > 1. 


TO 


d 
This will be close for = << 1. We will assume that the slit width L is 
To 
small compared to the distance to any point on the screen. Then 


Nd L 
—— = — << 1, in which case the approximate solution for r,, is valid 
ro ro 


for all n. This means that, for any point P on the distant screen, the light 
contributed by the n* source is approximately 
Equation: 


Ey, (y, t) — Re { F eflet—2n/)(rotndsing)] } 
= Re{ Fe Fm A)r0@—J2n/A)ndsing e jut 


The phasor representation for this function is just 
Equation: 


En (y) — Ze sa rig ate ation, 


Note that Eo (y), the phasor associated with the 0“ source, is 
a (27/A)ro_ Therefore we may write the phasor representation for the 


light contributed by the n*® source to be 
Equation: 


E,, (9) = Ey (p)e FOr naaing, 


This result is very important because it shows the light arriving at point P 


from different sources to be “out of phase” by an amount that depends on 
_ ndsin y 
the ratio 


Phasors and Interference. The phasor representation for the field observed 
at point P on the screen is the sum of the phasors contributed by each 
source: 


Equation: 
N-1 N-1 . 
E(y) = ) | En(y) = Bo(y) > e FOr nine 
n— n=0 


This is a sum of the form 
Equation: 


where the angle @ is (27/A)d sin y. This sum is illustrated in [link] for 
several representative values of 8. Note that for small 6, meaning small y, 
the sum has large magnitude, whereas for 0 on the order of 27/N, the sum 
is small. This simple geometric interpretation shows that for some values of 
(, Corresponding to some points P on the screen, there will be constructive 
interference between the phasors, while for other values of y there will be 
destructive interference. Constructive interference produces bright light, 
and destructive interference produces darkness. 


Phasor Sums for Diffraction 


The geometry of [link] is illuminating. However, we already know from our 
study of complex numbers and geometric sums that the phasor sum of 
Equation 3.85 may be written as 

Equation: 


fl = e—J(2n/A)Ndsiny 


E(¢) = Eo (¥)——sayame 


This result may be manipulated to produce the form 


Equation: 
. d 7 
E(y) = A ston) drog=iln/M(N-Lyasing 80 (Nr sin 9) 
N sin (x4 sin y) 


The magnitude is the intensity of the light at angle @ from horizontal: 
Equation: 


A sin (Nxt sin y) 


IE (¢)| =| I. 


N sin (nt sin y) 
Exercise: 
Problem: Derive [link] from [link]. 


Limiting Form. Huygens's model is exact when d shrinks to 0 and N 
increases to infinity in such a way that Nd — L, the slit width. Then 
Equation: 


L 
This function is plotted in [link] for two values of — the width of the slit 
L DN 
measured in wavelengths. When =i << 1i(ie., TL >> 1), then the light is 


L a 
uniformly distributed on the screen. However, when c > | (+ < 1) : 


then the function has many zeros for | sin y| < 1, as illustrated in the 
figure. These zeros correspond to dark spots on the screen where the fields 
radiated from the infinity of points within the slit interfere destructively. 


Interference Pattern for a Slit 


Exercise: 


Problem: Derive [link] from [link]. 


Exercise: 


Problem: (MATLAB) Plot the discrete approximation 
Equation: 
4 sin (Nré sin y) 


sin (r# sin y) 


L Nd 
versus sin y for iar Se 10 and N = 2,4, 8, 16, 32. Compare 


with the continuous, limiting form 
Equation: 


: Te 2s 
A sin (7+ sin y) 
L . 
my sin p 
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Let's add two sinusoids whose amplitudes and frequencies are identical and 
whose phases are different: 
Equation: 


z(t) = A cos (wt+ y) + Acos (wt + y+ y). 


Show analytically that this sum has the phasor representation 
Equation: 


OA ae (5) eile (¥/2)) 
9 


Interpret this finding. Then write a MATLAB program that computes and 
plots complex X on the complex plane as W varies from 0 to 27 and that 
plots magnitude, |X|, and phase, arg X, versus the phase angle w. (You 


2 
will have to choose w = nn = 0,1,...,.N —1, for a suitable N.) 


When do you get constructive interfelence and when do you get destructive 
interference? Now compute and plot x(t) versus t (you will need to 
discretize t) for several interesting values of #. Explain your interference 
results in terms of the amplitude and phase of x(t) and the magnitude and 
phase of X. Use the subplots discussed in "An Introduction to MATLAB" 
to plot all of your results together. 


Linear Algebra: Introduction 
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We use this chapter to introduce students to the algebraic structure of 
vectors and matrices and to introduce them to matrix computations. These 
matrix computations are used in the chapters "Vector Graphics", "Filtering", 
and "Binary Codes" to solve problems in vector graphics, filtering, and 
binary coding. 


Vectors are introduced in the section "Vectors", along with algebraic and 
geometric interpretations of some fundamental vector operations and 
properties. The section "Inner Product and Euclidean Norm", the section 
"Direction Cosines", and the section "Projections" introduce inner products 
and their applications, including norm, direction cosines, orthogonality, and 
projections. Some important alternatives to the Euclidean norm are 
introduced in the section "Other Norms". Matrices are motivated and 
introduced in this section. The notation in these sections can be daunting to 
the beginner, so we proceed very carefully, using example after example. In 
the section "Solving Linear Systems of Equations" we codify the 
elimination procedures that students have used in high school to solve linear 
systems of equations. The MATLAB demonstration in Demo 2 shows how 
to use MATLAB to solve linear equations. The section "Circuit Analysis" 
shows how linear algebra and MATLAB can be used to analyze dc circuits. 
The numerical experiment "Circuit Design" gives students practice in 
building function files in MATLAB and shows how to solve a sequence of 
linear equations in order to design a circuit with desired properties. 


Occasionally we have placed important results in the problems. We feel that 
students should not miss the material in Exercise 3 in "Vectors", Exercise 3 


in "Inner Product and Euclidean Norm", Exercise 3 in "Projections", 
Exercise 1 in "Matrices", and Exercise 4 in "Matrices". 


Introduction 


Linear algebra is a branch of mathematics that is used by engineers and 
applied scientists to design and analyze complex systems. Civil engineers 
use linear algebra to design and analyze load-bearing structures such as 
bridges. Mechanical engineers use linear algebra to design and analyze 
Suspension systems, and electrical engineers use it to design and analyze 
electrical circuits. Electrical, biomedical, and aerospace engineers use linear 
algebra to enhance X rays, tomographs, and images from space. In this 
chapter and the next we study two common problems from electrical 
engineering and use linear algebra to solve them. The two problems are (i) 
electrical circuit analysis and (ii) coordinate transformations for computer 
graphics. The first of these applications requires us to understand the 
solution of linear systems of equations, and the second requires us to 
understand the representation of mathematical operators with matrices. 


Much of linear algebra is concerned with systematic techniques for 
organizing and solving simultaneous linear equations by elimination and 
substitution. The following example illustrates the basic ideas that we 
intend to develop. 


Example: 

A woman steps onto a moving sidewalk at a large airport and stands while 
the moving sidewalk moves her forward at 1.2 meters/seconds. At the 
same time, a man begins walking against the motion of the sidewalk from 
the opposite end at 1.5 meters/second (relative to the sidewalk). If the 
moving sidewalk is 85 meters long, how far does each person travel 
(relative to the ground) before they pass each other? 

To solve this problem, we first assign a variable to each unknown quantity. 
Let x, be the distance traveled by the woman, and let x2 be the distance 
traveled by the man. The sum of the two distances is 85 meters, giving us 
one equation: 


Equation: 
21+ 22 = 85. 


Our second equation is based on the time required before they pass. Time 
equals distance divided by rate, and the time is the same for both people: 
Equation: 

L1 r2 


i SS I Le, = UL, 
12 io eZ 


2 
We may substitute [link] into [link] to obtain the result 03" + x2 = 85, 


or 
Equation: 


i oie — ee —— I 


Combining the result from [link] with that of [link], we find that 
Equation: 


Ly = 68. 
So the man travels 17 meters, and the woman travels 68 meters. 


[link] and [link] are the key equations of [link]. They may be organized 
into the “matrix equation” 


Equation: 
1 1 LI _ 895 
Oe2) al) ee | BO | 


The rules for matrix-vector multiplication are evidently 
Equation: 


(1) 21+ (1)r2 = 85 
Equation: 


(0. 3)x4 a (—1. 2)x2 = (; 


[link] and [link] may be organized into the matrix equation 


Equation: 
0 +) Zi} | 85 
0.3 -1.2||zo} [0] 


This equation represents one partially solved form of [link], wherein we 
have used the so-called Gauss elimination procedure to introduce a zero 
into the matrix equation in order to isolate one variable. The MATLAB 
software contains built-in procedures to implement Gauss elimination on 
much larger matrices. Thus MATLAB may be used to solve large systems 
of linear equations. 

Before we can apply linear algebra to more interesting physical problems, 
we need to introduce the mathematical tools we will use. 


Linear Algebra: Vectors 
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For our purposes, a vector is a collection of real numbers in a one- 
dimensional array.[footnote] We usually think of the array as being 
arranged in a column and write 

In a formal development of linear algebra, the abstract concept of a vector 
space plays a fundamental role. We will leave such concepts to a complete 
course in linear algebra and introduce only the functional techniques 
necessary to solve the problems at hand. 


In 


Notice that we indicate a vector with boldface and the constituent elements 
with subscripts. A real number by itself is called a scalar, in distinction 
from a vector or a matrix. We say that x is an n-vector, meaning that x has 
n elements. To indicate that x1 is a real number, we write 

Equation: 


Cie 


meaning that x; is contained in __, the set of real numbers. To indicate that 
x is a vector of n real numbers, we write 


Equation: 


n 


x€ ; 


meaning that x is contained in R”, the set of real n-tuples. Geometrically, 
R"is n-dimensional space, and the notation x € ™ means that z is a point 
in that space, specified by the n coordinates 21, £9, ..., Xn. [link] shows a 
vector in R, drawn as an arrow from the origin to the point z. Our 
geometric intuition begins to fail above three dimensions, but the linear 
algebra is completely general. 


eee ene nee eee we a = 


A Vector in R? 


We sometimes find it useful to sketch vectors with more than three 
dimensions in the same way as the three-dimensional vector of [link]. We 
then consider each axis to represent more than one dimension, a hyperplane, 
in our n-dimensional space. We cannot show all the details of what is 
happening in n-space on a three-dimensional figure, but we can often show 
important features and gain geometrical insight. 


Vector Addition. Vectors with the same number of elements can be added 
and subtracted in a very natural way: 


Equation: 
L1+ Yi L1— Y1 
LQ + Yo L2— Y2 
LEry= +y ;€-Y= 3 - YB 
| | 
En + Yn La = Uy 
Example: 
1 0 
The difference between the vectorx = 1 andthevectory= 0 _ isthe 
1 ai 
1 
vector z= x2—y= 1 . These vectors are illustrated in [link]. You can 
0 


see that this result is consistent with the definition of vector subtraction in 
[link]. You can also picture the subtraction in [link] by mentally reversing 
the direction of vector y to get —y and then adding it to z by sliding it to 
the position where its tail coincides with the head of vector x. (The head is 
the end with the arrow.) When you slide a vector to a new position for 
adding to another vector, you must not change its length or direction. 


Subtraction of Vectors 


Exercise: 


Problem: 


Compute and plot x + y and x — y for each of the following cases: 


1 1 
a= eye De *S 
2 3 
=1 1 
bem 3: . 4 2 
=2 3 
1 1 
C= eo. y=] Bo. 
2 2 


Scalar Product. Several different kinds of vector multiplication are 
defined.[ footnote] We begin with the scalar product. Scalar multiplication 
is defined for scalar a and vector x as 

The division of two vectors is undefined, although three different 
,Atidivisions, Au are defined in MATLAB. 

Equation: 


ary 
ar? 


ax= arg 


aAXn 


If |a| < 1, then the vector ax is "shorter" than the vector x; if |a| > 1, then 
the vector ax is ,"longer" than x. This is illustrated for a 2-vector in [Link]. 


Do 


~2X 


The Scalar Product ax 


Exercise: 


Problem: 


1 
Compute and plot the scalar product ax whenx = 1/2 for each of 
1/4 
the following scalars: 
re) emo 
b. a= —1; 
ca=—-1/4; 
dva-= 2. 
Exercise: 
Problem: 
Given vectors , , © ™andthescalara € _, prove the following 
identities: 


a.x + y = y + x. Is vector addition commutative? 

b. (x + y) +z2=x+ (y +2). Is vector addition associative? 

c. a(x + y) = ax + ay. Is scalar multiplication distributive over 
vector addition? 
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The inner product (x, y) between vectors x and y is a scalar consisting of 
the following sum of products: 
Equation: 


(x, y) = ©1Y1 + Toy2 + T3y3 +°°+ + TnYn- 


This definition seems so arbitrary that we wonder what uses it could 
possibly have. We will show that the inner product has three main uses: 


i. computing length or “norm”, 
ii. finding angles between vectors and checking for “orthogonality”, and 
iii. computing the “component of one vector along another” (projection). 


Since the inner product is so useful, we need to know what algebraic 
operations are permitted when we are working with inner products. The 
following are some properties of the inner product. Given x, y,z € @” and 


ac &, 
i. (x,y) = (y, x); 
te (axsy ) a(x, y) ; 
iii. (x, y + 2) = (x,y) + (x2). 


Exercise: 


Problem: 


Prove the three preceding properties by using the definition of inner 
product. Is the equation x(y, z) = (x, y)z also a true property? Prove 
or give a counterexample. 


Euclidean Norm. Sometimes we want to measure the length of a vector, 
namely, the distance from the origin to the point specified by the vector's 
coordinates. A vector's length is called the norm of the vector. Recall from 
Euclidean geometry that the distance between two points is the square root 
of the sum of the squares of the distances in each dimension. Since we are 
measuring from the origin, this implies that the norm of the vector x is 


Equation: 
|| x |= 4/0? + 2 + +22. 


Notice the use of the double vertical bars to indicate the norm. An 
equivalent definition of the norm, and of the norm squared, can be given in 
terms of the inner product: 


Equation: 
Il x |= + x) 
or 
Equation: 
2 
| x ||" = (x, x) 
Example: 


The Euclidean norm of the vector 
Equation: 


is || x ||= 4/12 + 32 + 52 + (—2)? = V39. 


An important property of the norm and scalar product is that, for any 
x€ GF andacGZ, 
Equation: 


||ax|| = Ja/| |x|]. 


So we can take a scalar multiplier outside of the norm if we take its absolute 
value. 
Exercise: 


Problem: Prove [link]. 


Cauchy-Schwarz Inequality. Inequalities can be useful engineering tools. 
They can often be used to find the best possible performance of a system, 
thereby telling you when to quit trying to make improvements (or proving 
to your boss that it can't be done any better). The most fundamental 
inequality in linear algebra is the Cauchy-Schwarz inequality. This 
inequality says that the inner product between two vectors z and y is less 
than or equal (in absolute value) to the norm of zx times the norm of y, with 
equality if and only if y = ax: 

Equation: 


I(x, y)] S |IxI[lIyIl. 


To prove this theorem, we construct a third vector z = Ax — y and measure 
its norm squared: 
Equation: 


J|Ax — y||? = (Ax — y, Ax — y) = A ||x||? — 2A (x, y) + |lyl|? = 0. 


So we have a polynomial in A that is always greater than or equal to 0 
(because every norm squared is greater than or equal to 0). Let's assume 
that x and y are given and minimize this norm squared with respect to A. To 
do this, we take the derivative with respect to A and equate it to 0: 
Equation: 


(x,y) 


2 
> 


2A||x||? — 2 (x,y) =0>A= 


When this solution is substituted into the formula for the norm squared in 
[link], we obtain 
Equation: 


(x, y) ‘ll? = 2(x, y) 


2 
2 2 (x,y) + |lyl| = 0, 
|><]| |>]| 


which simplifies to 
Equation: 


(x,y)° 


i 2 
|x| 


+ |ly||? > 0 = (x,y)? < |[x|[*ly||?. 


The proof of the Cauchy-Schwarz inequality is completed by taking the 
positive square root on both sides of [link]. When y = ax, then 
Equation: 


(x,y)? = (c,ax)? = [lal (x,2))? = (lallxl?)? 


= (ee|"|IxI")|[<1|” 


] 
e 
ia 
Q 
sey 
ie 


(y, y)||x 
= Iles lee 


which shows that equality holds in [link] when y is a scalar multiple of z. 
Exercise: 


Problem: 
Use the Cauchy-Schwarz inequality to prove the triangle inequality, 
which states 


Equation: 


Ix + yl] < [xl] + Ilyll- 


Explain why this is called the triangle inequality. 
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Unit Vectors. Corresponding to every vector z is a unit vector uz pointing 
in the same direction as x. The term unit vector means that the norm of the 


vector is 1: 
Equation: 


j|ux || = 1. 


The question is, given x, how can we find u,? The first part of the answer 
is that uw, will have to be a positive scalar multiple of x in order to point in 
the same direction as x, as shown in [link]. Thus 

Equation: 


Ux = OX. 


The Unit Vector u, 


But what is a? We need to choose a so that the norm of wu, will be 1: 
Equation: 


j|Ux || = 1 
Equation: 

jax|| = 1 
Equation: 

jo|||x|| = 1 
Equation: 

1 
© UbeI| 


We have dropped the absolute value bars on a@ because ||x|| is positive. The 
a that does the job is 1 over the norm of x. Now we can write formulas for 
Uy, in terms of x and z in terms of uz : 

Equation: 


Equation: 


x= |fx||s. 


So the vector z is its direction vector u,, scaled by its Euclidean norm. 


Unit Coordinate Vectors. There is a special set of unit vectors called the 
unit coordinate vectors. The unit coordinate vectore; is a unit vector in V” 


that points in the positive direction of the i** coordinate axis. [link] shows 
the three unit coordinate vectors in V°®. 


Ls 


Unit Coordinate Vectors in R? 
For three-dimensional space, R?, the unit coordinate vectors are 


Equation: 


More generally, in n-dimensional space, there are n unit coordinate vectors 
given by 


cn) 


e;= |1{ — i*® element 


a) 


0 «— n elements. 


You should satisfy yourself that this definition results in vectors with a 
norm of I. 


Exercise: 


Problem: Find the norm of the vector au where wu is a unit vector. 


Exercise: 


Problem: Find the unit vector wu, in the direction of 


3 
bese > 6 ' 

—2 

1 

—l 
i 

1 


Direction Cosines. We often say that vectors “have magnitude and 
direction.” This is more or less obvious from "Linear Algebra: Vectors", 


where the three-dimensional vector x has length ,/ x? + «3 + #3 and 


points in a direction defined by the components x1, x2, and x3. It is 
perfectly obvious from [link] where x is written as x = ||x||u,. But perhaps 
there is another representation for a vector that places the notion of 
magnitude and direction in even clearer evidence. 


[link] shows an arbitrary vector x € &? and the three-dimensional unit 
coordinate vectorse,, €2, €3. The inner product between the vector x and 
the unit vector e; just reads out the k** component of z: 

Equation: 


(x, ex) = (ex, x) = Lk. 
Since this is true even in R”, any vector x € &” has the following 


representation in terms of unit vectors: 
Equation: 


x = (x,e1)e, + (x, e2)eg + --- + (x, en)en- 


The Three-Dimensional Unit Vectors 


Let us now generalize our notion of an angle 0 between two vectors to R” 
as follows: 
Equation: 


Oy) 


cos 9 = ———_ 
Tesiibal 


The celebrated Cauchy-Schwarz inequality establishes that —1— < cos 
8 < 1. With this definition of angle, we may define the angle 6; that a 
vector makes with the unit vector e; to be 


Equation: 
x,e 
IIx] |[ex || 
But the norm of e; is 1, so 
Equation: 
(x, ex) Lk 
cos 0, = — 
||| Ix|| 


When this result is substituted into the representation of x in [link], we 
obtain the formula 
Equation: 


x = ||x|| cos 01e1 + ||x|| cos Age. + --- + ||x|| cos Onen 
||x||(cos 03e;+ cos Oge2 + ---+ cos One). 


This formula really shows that the vector x has “magnitude” ||x|| and 
direction (01, 62, ...,9,) and that the magnitude and direction are sufficient 
to determine x. We call ( cos 61, cos 09, ..., cos 9,) the direction cosines for 
the vector z. In the three-dimensional case, they are illustrated in [link]. 


Exercise: 


Problem: 


Show that [link] agrees with the usual definition of an angle in two 
dimensions. 


Exercise: 


Problem: Find the cosine of the angle between x and y where 


1 2 
ace ye 2 : 

0 2 

1 —1 

—1 0 
x= 1 v= 1 : 

—1 0 

2 4 
cz] bt ya 2 

—2 —4 


If we compare [link] and [link] we see that the direction vector wz is 
composed of direction cosines: 


Equation: 
cos0; 
cosO> 
u,, =cos 8,;e;+ cos O,e) + ---+ cos 6,e, = 
cos6,, 


With this definition we can write [link] compactly as 


Here z is written as the product of its magnitude ||x|| and its direction 
vector u,. Now we can give an easy procedure to find a vector's direction 
angles: 


’ 


(i) find ||x 


x 
(ii) calculate u, = ——; and 


|x 


(iii) take the arc cosines of the elements of wz. 


Step (iii) is often unnecessary; we are usually more interested in the 
direction vector (unit vector) u,. Direction vectors are used in materials 
science in order to study the orientation of crystal lattices and in 
electromagnetic field theory to characterize the direction of propagation for 
radar and microwaves. You will find them of inestimable value in your 
courses on electromagnetic fields and antenna design. 

Exercise: 


Problem: 


Sketch an arbitrary unit vector u € #°?. Label the direction cosines 
and the components of u. S 


Exercise: 
Problem: 
4 
Compute the norm and the direction cosines for the vectorx = 2 
6 
Exercise: 
Problem: 


Prove that the direction cosines for any vector satisfy the equality 
Equation: 


cos 7 6;+ cos” 0. +---+ cos’ 6, = 1. 


What does this imply about the number of scalars needed to determine 
avectorx € #"? 


Exercise: 


Problem: 


Astronomers use right ascension, declination, and distance to locate 
T 
stars. On [link] these are, respectively, —y, ie 63, and ||x]]. 


Represent x = (#1, £2, x3) in terms of y, 63, and ||x|| only. (That is, 
find equations that give y, 03, and ||x|| in terms of 71, x2, and x3, and 
find equations that give x1, x2, and x3 in terms of , 63, and ||x]|. ) 


Exercise: 
Problem: 
(MATLAB) Write a MATLAB function that accepts any vector 


x € @” and computes its norm and direction cosines. Make x an input 
variable, and make ||x|| and u, output variables. 


Exercise: 
Problem: 
Let x and y denote two vectors in R” with respective direction cosines 
(cos 01, cos 2, ...,cos 8,) and ( cos 1, COS $2, ..., COS Yn). Prove 


that a, the angle between x and y, is 
Equation: 


cos w=cos 4; cos yi+ cos 42 cos y2+---+cosO, COS Yn. 


Specialize this result to R? for insight. 


Exercise: 


Problem: 


2 2 
Compute the angle between the vectorsx = 3 and 3 _.Sketchthe 
0 1 


result. 


Linear Algebra: Projections 


Note:This module is part of the collection, A First Course in Electrical and 
Computer Engineering. The LaTeX source files for this collection were 
created using an optical character recognition technology, and because of 
this process there may be more errors than usual. Please contact us if you 
discover any errors. 


Orthogonality. When the angle between two vectors is 7/2 (90°), we say 
that the vectors are orthogonal. A quick look at the definition of angle 
(Equation 12 from "Linear Algebra: Direction Cosines") leads to this 
equivalent definition for orthogonality: 

Equation: 


(x,y) =0<-x and y are orthogonal. 


3 —2 
For example, in Figure 1(a), the vectors x = H and y = ; | are clearly 


orthogonal, and their inner product is zero: 


Equation: 
(x,y) = 3(—2) + 1(6) =0. 
3 —2 0 
In Figure 1(b), the vectorsx= 1 ,y= 6 ,andz= O are 
0 0 4 


mutually orthogonal, and the inner product between each pair is zero: 
Equation: 


(x, y) = 3(—2) + 1(6) + 0(0) = 0 


Equation: 


(x,z) = 3(0) + 1(0) + 0(4) =0 


Equation: 


Orthogonality of Vectors 


Exercise: 


Problem: Which of the following pairs of vectors is orthogonal: 


1 0 
ac] ). y= 1 3 
1 0 
1 1 
bx 1 oye iL. a 
0 1 


We can use the inner product to find the projection of one vector onto 
another as illustrated in Figure 2. Geometrically we find the projection of x 


onto y by dropping a perpendicular from the head of x onto the line 
containing y. The perpendicular is the dashed line in the figure. The point 
where the perpendicular intersects y (or an extension of y) is the projection 
of x onto y, or the component of x along y. Let's call it z. 


Component of One Vector along 
Another 


Exercise: 


Problem: 


Draw a figure like Figure 2 showing the projection of y onto x. 


The vector z lies along y, so we may write it as the product of its norm ||z]| 
and its direction vector wu, : 
Equation: 
bs 
z= ||z||uy = ||2||——. 
° IIyl| 


But what is norm ||z||? From Figure 2 we see that the vector z is just z, plus 
a vector v that is orthogonal to y: 
Equation: 


x=2z+v, (v,y) =0. 


Therefore we may write the inner product between x and y as 
Equation: 


(x,y) =(z+Vv,y) = (z,y) + (v,y) = (2, y). 


But because z and y both lie along y, we may write the inner product (x, y) 
as 


Equation: 
(x,y) = (zy) = (lalluy, lylluy ) = lal [ly] Cay, uy) 
2 
= lly hall" = [el{ily 
_ (x,y) . 
From this equation we may solve for ||z|| = ill and substitute ||z]| into 
ha 
Equation 6 to write z as 
Equation: 
Z = 257 
_ (&y) _y _ Gy) 
~ Ty Ty ~ Gx) 


Equation 10 is what we wanted—an expression for the projection of x onto y 
in terms of x and y. 
Exercise: 


Problem: 


Show that ||z|| and z may be written in terms of cos 0 for 0 as 
illustrated in Figure 2: 
Equation: 


[|z|| = ||x]] cos 0 
Equation: 


|x|] cos 6 


ial 


Orthogonal Decomposition. You already know how to decompose a vector 
in terms of the unit coordinate vectors, 
Equation: 


X = (x,e1)e1 + (x,e2)e2 +--+ + (x, en)en. 


In this equation, (x, e,)ez is the component of x along eg, or the projection 
of x onto ex, but the set of unit coordinate vectors is not the only possible 
basis for decomposing a vector. Let's consider an arbitrary pair of orthogonal 
vectors x and y: 

Equation: 


(x, y) = 0. 


The sum of x and y produces a new vector w, illustrated in Figure 3, where 
we have used a two-dimensional drawing to represent n dimensions. The 
norm squared of wi is 


Iwi]? = (w,w) = [(x+y), (x+y)] = (xx) + (xy) + (y,x) + (yy) 
= Ilx||? + lIyll’. 


This is the Pythagorean theorem in n dimensions! The length squared of w 
is just the sum of the squares of the lengths of its two orthogonal 
components. 


Sum of Orthogonal Vectors 


The projection of w onto x is x, and the projection of w onto y is y: 
Equation: 


w = (1)x+ (1)y. 


If we scale w by a to produce the vector z = aw, the orthogonal 
decomposition of z is 
Equation: 


Z = aw = (a)x + (a)y. 


Let's turn this argument around. Instead of building w from orthogonal 
vectors x and y, let's begin with arbitrary w and x and see whether we can 
compute an orthogonal decomposition. The projection of w onto x is found 
from Equation 10: 

Equation: 


But there must be another component of w such that w is equal to the sum of 
the components. Let's call the unknown component w,. Then 
Equation: 


W = Wx + Wy. 


Now, since we know w and wz already, we find wy to be 
Equation: 


Interestingly, the way we have decomposed w will always produce w, and 
Wsy orthogonal to each other. Let's check this: 


Equation: 
(WasWy) = Gay WH Ey 
= ew) - x) 
r= (w,x)" _ (w,x)’ 
(x,x) (x,x) 
= 0. 


To summarize, we have taken two arbitrary vectors, w and x, and 


decomposed w into a component in the direction of x and a component 
orthogonal to z. 


Linear Algebra: Other Norms 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Sometimes we find it useful to use a different definition of distance, 
corresponding to an alternate norm for vectors. For example, consider the I- 
norm defined as 

Equation: 


xl] = (le [+] [+--+ +n), 


where |z,| is the magnitude of component x;. There is also the sup-norm, 
the “supremum” or maximum of the components 71,...,%n: 
Equation: 


a leas —max (\x1 |, |x F neey |X |). 


The following examples illustrate what the Euclidean norm, the l-norm, and 
the sup-norm look like for typical vectors. 


Example: 


Consider the vectorx = 1 . Then 


i ell = [(-3)° +)? + @)"] = 04)"; 
ill, = (| — 31 + [1] + 21) = 65 ana 


iii. |[X|| up =naax (| — 3], [1], [2]) = 3. 

Example: 
£ 

Figure 1 shows the locus of two-component vectors x = ] with the 
so) 

property that ||x|| = 1, iesie = 1, a [Psilene =. 

locus of X such a locus of x such 

that |IXilsup=! that |Ixl|l,= 1 


eg 


locus of x such 
that xl, = | 


Locus of Two-Dimensional Vectors Whose 
Various Norms Are 1 


The next example shows how the |-norm is an important part of city life. 


Example: 

The city of Metroville was laid out by mathematicians as shown in Figure 
2. A person at the intersection of Avenue 0 and Street —2 (point A) is 
clearly two blocks from the center of town (point C). This is consistent 
with both the Euclidean norm 

Equation: 


|All = 02 + (-2)? = V4 =2 


and the l-norm 
Equation: 


Al], = (0] + | — 2[) = 2. 


But how far from the center of town is point B at the intersection of 
Avenue-2 and Street 1? According to the Euclidean norm, the distance is 


Equation: 
||Bl| = y (-2)° + @)° = v5. 


Street -2 
Street -1 
Street 0O 
Street 2 


Avenue 2 


Z > 


Metroville, U.S.A. 


While it is true that point B is 5 blocks from C,, it is also clear that the 
trip would be three blocks by any of the three shortest routes on roads. The 
appropriate norm is the l-norm: 

Equation: 


271], = (1-21 + [1)) = 3. 


Even more generally, we can define a norm for each value of p from 1 to 
infinity. The so-called p-norm is 
Equation: 
p Pp 
[Tx||, = (arf +la2f +-+-+|an|?)"””. 


Exercise: 
Problem: 
Show that the Euclidean norm is the same as the p-norm with p = 2 


and that the 1-norm is the p-norm with p = 1. (It can also be shown 
that the sup-norm is like a p-norm with p = oo.) 


DEMO 4.1 (MATLAB). From the command level of MATLAB, type the 
following lines: 


>> xX = [1;3;-2;4] 
>> y = [0;1;2;-0.5] 
>> X - y 


Check to see whether the answer agrees with the definition of vector 
subtraction. Now type 


>> a= -1.5 
woe 8 oX 


Check the answer to see whether it agrees with the definition of scalar 
multiplication. Now type 


>> Xx ' * y 
This is how MATLAB does the inner product. Check the result. Type 


>> norm(y) 
>> sqrt(y' * y) 


>> norm(y,1) 
>> norm(y' * y) 


Now type your own MATLAB expression to find the cosine of the angle 
between vectors x and y. Put the result in variable t. Then find the angle 0 


by typing 
>> theta = acos(t) 


The angle @ is in radians. You may convert it to degrees if you wish by 
multiplying it by 180/z: 


>> theta = theta * (180/p1) 


Linear Algebra: Matrices 


Note:This module is part of the collection, A First Course in Electrical and Computer Engineering. The LaTeX 
source files for this collection were created using an optical character recognition technology, and because of this 
process there may be more errors than usual. Please contact us if you discover any errors. 


The word matrix dates at least to the thirteenth century, when it was used to describe the rectangular copper tray, or 
matrix, that held individual leaden letters that were packed into the matrix to form a page of composed text. Each 
letter in the matrix, call it a;;, occupied a unique position (27) in the matrix. In modern day mathematical parlance, 
a matrix is a collection of numbers arranged in a two-dimensional array (a rectangle). We indicate a matrix with a 
boldfaced capital letter and the constituent elements with double subscripts for the row and column: 


@11 4120 130 e+ Ih 

G21 G22 @23 «++ Gan + second row 
A= a31 a32 a33 enane. a3n 

Qm1 Gm2 Am3 +--+ Amn 


third column 


Matrix 


In this equation A is an m x n matrix, meaning that A has m horizontal rows and n vertical columns. As an 
extension of the previously used notation, we write A © #™*" to show that A is a matrix of size m x n with 
aij € &. The scalar element a,; is located in the matrix at the i** row and the jh column. For example, a3 is 
located in the second row and the third column as illustrated in [link]. 


The main diagonal of any matrix consists of the elements a,;;. (The two subscripts are equal.) The main diagonal 
runs southeast from the top left corner of the matrix, but it does not end in the lower right commer unless the matrix 
is square (€ Z™*™), 


The transpose of a matrix A € #™*" is another matrix B whose element in row j and column 7 is bj; = aj; for 
1<i<mand1l<j< 7. We write B = A’ to indicate that B is the transpose of A. In MATLAB, transpose is 
denoted by A'. A more intuitive way of describing the transpose operation is to say that it flips the matrix about its 
main diagonal so that rows become columns and columns become rows. 

Exercise: 


Problem: If A ¢ #™*”, then A? €?. Find the transpose of the matrix A = 


Nob 
Co Fk eR 


Matrix Addition and Scalar Multiplication. Two matrices of the same size (in both dimensions) may be added 
or subtracted in the same way as vectors, by adding or subtracting the corresponding elements. The equation 

C = A+B means that for each i and j, ¢;; = a;; + b,;. Scalar multiplication of a matrix multiplies each element 
of the matrix by the scalar: 

Equation: 


QXi1, aAxr{2 axi13 se ALin 


QX%9, AX22 a3 ws ALIn 
aX = ax3, ar3. a%33 ... AL3n 
ALm) ALm2 ALm3Z ... ALmn 


Matrix Multiplication. A vector can be considered a matrix with only one column. Thus we intentionally blur the 


distinction between #”™! and R". Also a matrix can be viewed as a collection of vectors, each column of the 
matrix being a vector: 


Equation: 


= a a .. a, = 


In the transpose operation, columns become rows and vice versa. The transpose of ann x 1 matrix, a column 
vector, isa 1 xn matrix, a row vector: 


Equation: 


Ty 
LQ 
x= xP = [v1222p]. 


In 


Now we can define matrix-matrix multiplication in terms of inner products of vectors. Let's begin with matrices 


AEB” andB € &””. To find the product AB, first divide each matrix into column vectors and row vectors 
as follows: 


[— at —] 


A= al a2 vane an = aa az ae 


[— a, —] 


Thus a; is the i** column of A and a7 is the j" row of A. For matrix multiplication to be defined, the width of the 
first matrix must match the length of the second one so that all rows a? and columns b; have the same number of 


elements n. The matrix product, C = AB, is anm x p matrix defined by its elements as c,; = (ai, by); In words, 


each element of the product matrix, c;;, is the inner product of the i** row of the first matrix and the j*” column of 
the second matrix. 


For n-vectors x and y, the matrix product x’ y takes ona special significance. The product is, of course, a1 x 1 
matrix (a scalar). The special significance is that x7y is the inner product of x and y: 


P= ox =] 


y | = (x,y). 


Thus the notation x7y is often used in place of (x, y). Recall from Demo 1 from "Linear Algebra: Other Norms" 
that MATLAB uses Xx ' *y to denote inner product. 


Another special case of matrix multiplication is the outer product. Like the inner product, it involves two vectors, 
but this time the result is a matrix: 


| T TiY1 = Liy2 ***  L1Yn 
— yr — 
T2Y1 @2y2 "'* Fey 
A= |x = : : 
| 
ZmY1 TmY2 **' LmYn 


In the outer product, the inner products that define its elements are between one-dimensional row vectors of x and 
one-dimensional column vectors of y”, meaning the (i, 7) element of A is xjy;. 
Exercise: 


Problem: Find C = AB where A and B are given by 


a i <2 2 <8 
aA=|) | B= 42 a0 
Dt 4 
1 0 1234 
ba=|j Bela alt 
a | 03 6 
cA= 1-11 ,B=147 
11 1 258 


There are several other equivalent ways to define matrix multiplication, and a careful study of the following 
discussion should improve your understanding of matrix multiplication. Consider A € #”™*",B € Z"*?, and 
C = ABso that C € #™”*?. In pictures, we have 

Equation: 


In our definition, we represent C' on an entry-by-entry basis as 
Equation: 


n 
Cij = (a;,b;) = eo tik Oks 


In pictures, 
Cij [— ai 


You will prove in Exercise 3 that we can also represent C’ on a column basis: 
Equation: 


n 
c; = d azbdz;. 
IT YS kj 


Finally, C' can be represented as a sum of matrices, each matrix being an outer product: 
Equation: 


C= Sapr 


A numerical example should help clarify these three methods. 


Example: 
Let 
Equation: 
i 2 i § os 
Ae ZBilBa, v= ee 
- 3 8 2 i — 
a it il 


Using the first method of matrix multiplication, on an entry-by-entry basis, we have 
Equation: 


4 
Cay = oe Bik Ok 


or 
Equation: 


(1-142-241-143-2) (1-24+2-2+1-3+43-1) (1-142-1+1.- 3-1) 
C= (2-141-242-144.-2) (2-24+1-242-344-1) (2-141-142-2441) 
(3-143-24+2-141-2) (3-24+3-2+2-3+41-1) (3-14+3-1+2.- 1-1) 
or 
Equation: 
I WA 
C= ia 16 iil , 
13 19 Il 
On a column basis, 
Equation: 
3 ayb 
Ce ee ROS 
1 2 1 1 2 3 1 2 1 
= 2 alse i Aste 4 ise 4 Bey 4 At i AS Bae 4 Ieey= A ise I ila 2 
3 3 2 1 3 3 1 3 3 2 
Collecting terms together, we have 
Equation: 
C = [c1c2c3| 
(1-142-241-143.-2) (1-24+2-2+4+1-3+3-1) (1-1+2-1+4+1-2+4+3-1) 
C= (2-141-24+2-144-2) (2-24+1-242-344-1) (2-141-142-2441) 
(3-143-242-141-2) (3-24+3-2+4+2-3+1-1) (3-1+3 2-2+1-1) 
On a matrix-by-matrix basis, 
Equation: 
4 
C= daisy 
Equation: 
1 2 1 3 
G= 7% i Ral ae il Bee ae se iL ee SE bh SE a at al 
3 3 2 1 
Equation: 
1-1 1-2 1-1 wt WoW MoM koll ileg iow 3:2 3-1 3-1 
= Boll BoA Bell + ilew jley ilo =e Bell 32-2 + 4-2 4-1 4-1 
soll go4 sjoll a2 2 3-2) 3-1 mel Bos wow ho. itell ile 
Equation: 
(1-142-241-143-2) (1-2+4+2.-2 -343-1) (1-142-141-2+43-1) 
C= (2-141-242-144-2) (2-24+1-242-344.-1) (2-141-142-2441) 
(3-143.- 2-1+1-2) (83-2+3-2+4+2-3+41-1) (8-1+3-14+2-2+41-1) 


as we had in each of the other cases. Thus we see that the methods are equivalent-simply different ways of 
organizing the same computation! 


Exercise: 
Problem: 
Prove that Equations 9, 11, and 13 are equivalent definitions of matrix multiplication. That is, if C = AB 


where A € #™*”" and B € &"*?, show that the matrix-matrix product can also be defined by 
Equation: 


n 
Ce y aizdn;, 
fel 


and, if cy is the k** column of C and ay, is the k™ column of A, then 
Equation: 


n 
Cj = ) apdg;. 
k=1 


Show that the matrix C' may also be written as the , "sum of outer products" 
Equation: 


C= S arGh- 
k=1 


Write out the elements in a typical outer product a, Oe 


Exercise: 


Problem: 


Given A € #”"*" Be &*4, and C € Z"**, for each of the following postulates, either prove that it is true 
or give a counterexample showing that it is false: 


a (AT)" =A. 

b. AB = BA when n = pand m = q. Is matrix multiplication commutative? 

c. A(B+C) = AB+ AC when n = p= rand q = s. Is matrix multiplication distributive over 
addition? 

d. (AB a = BTAT whenn = p. 

e. (AB)C = A(BC) when n = p and q = T. Is matrix multiplication associative? 


Example: 

Rotation 

We know from the chapter on complex numbers that a complex number z; = x; + jy; may be rotated by angle 0 
in the complex plane by forming the product 

Equation: 


zo =e" z,. 


When written out, the real and imaginary parts of z2 are 
Equation: 


a (cos 6+ j sin 8) (x1 + jy1) 
(cos #)x; — (sin 6)y1 + j[(sin @)x1 + (cos 6)y3]. 


If the real and imaginary parts of z; and Z2 are organized into vectors z; and Z2 as in the chapter on complex 
numbers, then rotation may be carried out with the matrix-vector multiply 


Equation: 
L cosO —sin@] |x, 
Z2 — = a i 
Y2 sin? cos0 Yi 


cos? —sin@ 


We call the matrix R (#) = | a rotation matrix. 


sin@ cos0 


Exercise: 


Problem: Let R(@) denote a 2 x 2 rotation matrix. Prove and interpret the following two properties: 
a R? (0) = R(—6); 


b. R7 (6)R (6) = R(6)R7 (8) = F i} 


Linear Algebra: Solving Linear Systems of Equations 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


We are now equipped to set up systems of linear equations as matrix- vector 
equations so that they can be solved in a standard way on a computer. 
Suppose we want to solve the equations from Example 1 from "Linear 
Algebra: Introduction" for 2; and x2 using a computer. Recall that 
Equations 1 and 2 from Linear Algebra: Introduction are 

Equation: 


21 +22 = 85 
NS es: Ss 
1.2 ~— 1.5-1.2 


The first step is to arrange each equation with all references to x, in the 
first column, all references to x» in the second column, etc., and all 
constants on the right-hand side: 

Equation: 


21 +22 = 85 
0.32, —1.2z%. = 0. 


Then the equations can be converted to a single matrix-vector equation. The 
coefficients form a matrix, keeping their same relative positions, and the 
variables and constants each form a vector: 

Equation: 


Exercise: 


Problem: 


Verify by the rules of matrix multiplication that the system of 
equations in 2 in equivalent to the matrix equation in 3. 


Equation 3 is of the general form 
Equation: 


> 
8 
| 
ion 


where in this case 
Equation: 


Given any A € @”*”" and b € &”, MATLAB can solve Equation 4 for x 
(as long as a solution exists). Key ideas in the solution process are the 
identity matrix and the inverse of a matrix. 


When the matrix A is the 1 x 1 matrix a, the vector z is the l-vector x, and 
the vector b is the l-vector 6b, then the matrix equation Ax = b becomes the 
scalar equation 

Equation: 


ax = b. 
The scalar a! is the inverse of the scalar a, so we may multiply on both 


sides of Equation 6 to obtain the result 
Equation: 


a '(ax) =a~1b 


le =a~‘b. 


We would like to generalize this simple idea to the matrix equation Ax = b 
so that we can find an inverse of the matrix A, called A~‘, and write 
Equation: 


A} (Az) — A-!ph 
Ix = A~'b. 


In this equation the matrix J is the identity matrix 


Equation: 
1 0 0 0 
01 0 0 
1- 9901 0 
0 0 O 1 


It is clear that the identity matrix J and the inverse of a matrix, A~?, play a 
key role in the solution of linear equations. Let's study them in more detail. 


The Matrix Identity. When we multiply a scalar by 1, we get back that 
same scalar. For this reason, the number 1 is called the 
multiplicativeidentity element. This may seem trivial, but the generalization 
to matrices is more interesting. The question is, is there a matrix that, when 
it multiplies another matrix, does not change the other matrix? The answer 
is yes. The matrix is called the identity matrix and is denoted by J. The 
identity matrix is always square, with whatever size is appropriate for the 
matrix multiplication. The identity matrix has I's on its main diagonal and 
0's everywhere else. For example, 

Equation: 


1 00 0 0 
010 0 0 
Is= 00 1 0 0 
000 1 0 
00001 


The subscript 5 indicates the size. In terms of the unit coordinate vectors e;, 
we can also write the n x n identity matrix as 
Equation: 


For any matrix A € Z””?, we have 
Equation: 


A=1I,A. 


For p = 1, we obtain the following special form for any vector x € &” : 
Equation: 
x= 1x, 


This last equation can be written as the sum 
Equation: 


n 
x= ) Ljey. 
i=1 


This is a special case of one of the results you proved in Exercise 3 from 
"Linear Algebra: Matrices". Figure 1 illustrates Equation 14 and shows how 


the vector x can be broken into component vectors x;e;, each of which lies 
in the direction of one coordinate axis. The values of the xs are the 
coordinates of z, and their magnitudes are also the lengths of the 
component vectors. 


Breaking a Vector into Components 


Exercise: 
Problem: 
Use Equation 13 and the rules for matrix multiplication to show that 


x € &” may also be written as 
Equation: 


3 


This verifies Equation 11 from "Linear Algebra: Direction Cosines". 


When the product of two numbers is 1 (the identity element), we say that 
they are inverses of each other, like 2 and 0.5. Likewise, we say that two 
square matrices are inverses of each other if their product is the identity 
matrix: 


Equation: 


AB=ISB=A". 


An interesting and useful result is implied by this definition. Take the first 
form of Equation 16 and multiply by B from the left: 


Equation: 

AB =I 
= B(AB)=BI 
=> (BA)B=B 
=> BA =I. 


We emphasize that the inference made in the last step here is only valid 
when B and A are square matrices. The result means that, even though 
matrix multiplication is not commutative in general, we have a special case 
where it is commutative. If A and B are inverses of each other, then 
Equation: 


AB=BA=lI. 


Exercise: 


Problem: 


Prove that the inverse of the 2 x2 rotation matrix R(@) is R7 (6). 


Matrices that are not square do not have inverses. In fact, not all square 
matrices have inverses. So it becomes an important issue to determine 
which matrices do have inverses. If a matrix has an inverse, the inverse is 
unique. If a square matrix has no inverse, it is called a singular matrix. The 
determinant of a square matrix is a scalar computed from the numbers in 
the matrix. It tells us whether a matrix will have an inverse or not. A matrix 
is singular if and only if its determinant is zero.[footnote] In MATLAB, the 
notation det (A) is used to compute the determinant. Whenever the matrix 
A in Equation 4 is singular, it means one of two things about the system of 


equations represented: either the equations are inconsistent and there is no 
solution, or the equations are dependent and there are infinitely many 
solutions. 

It is not important now to understand how the determinant is defined and 
computed from the elements of a matrix. In your more advanced courses 
you will study the determinant in some detail. 


Solving Ax = b. Let's now study ways to solve the matrix equation 

Ax = b. We will assume that a unique solution for x exists. Thus a unique 
matrix A~! exists with the property A~'A = I. The trick is to find it. Here 
is one procedure. 


For convenience, rewrite the matrix equation Ax = b as 
Equation: 
x 
Ab = 0. 
=1 


The matrix | Ab] € #™("+1) is called the augmented matrix for the system 


of equations. The augmented matrix may be viewed as a systematic way of 
writing all the information necessary to solve the equations. 


The advantage of Equation 19 is that we may premultiply both sides by any 
matrix C’; (of compatible size), and the right-hand side remains zero 
(although this is equivalent to multiplying on both sides of Equation 4, 
which some may prefer). We can repeat this operation as often as we like 
with matrices Co, C3, etc. The general result is 

Equation: 


[Cm itis CoC, AC,, ee CC b] | = (j, 


Now suppose that we have found a sequence of matrices C,... , C, that 
transforms the matrix A to the identity matrix: 


Cm-++:C2C,A =I. The augmented matrix equation in 20 simplifies to 


Equation: 


x 
1em---cxc][*,] <0 


which can be multiplied out to reveal that the solution for x is 
Equation: 


x= Cm-+: CoC 1b. 


The method may be summarized as follows: 


i. form the augmented matrix [Ab] for the system; 
ii. premultiply [Ab] by a sequence of selected matrices C’;, designed to 
take A to J; and 
iii. when A is transformed to J, b will be transformed to z, so the solution 
will appear as the last column of the transformed augmented matrix. 


We may also conclude that the product of the matrices C; must be the 
inverse of A since A+ is the unique matrix for which A-'A = I. In 
solving for x by this method, we have found A~! implicitly as the product 


Cm: CoCr. 


Example: 
Consider the equation 
Equation: 


Equation: 
7p cat ae| oy 


The augmented matrix for this equation is 


Equation: 


Now if we could add —2/3 times the first row to the second row, we 
would get 0 in the lower left corner. This is the first step in transforming A 
to the identity I. We can accomplish this row operation with the matrix 


Equation: 
1 
C= : 
—2/3 1 
Equation: 
Sl 
C, [Ab] = als 
0 10/3 1 


Now adding —3/10 times the new second row to the first row will 
introduce 0 in the (1, 2) position, bringing us closer still to the identity. 
Thus 

Equation: 


aoe : — 


Equation: 


C0, [Ab] = 0 das 


0 10/3 1 


We now complete the transformation to identity by normalizing each row 
to get the needed I's: 
Equation: 


= ee aa 


Equation: 


Cy0,C, [Ab] = ; 0 al 


0 1 3/10 


According to the last column, the solution is 


Equation: 
_ [19/10 
= lave | 


We note in passing that the inverse of A is the product of the C's, so 


BES ORCICT = ha mn ki ial Exe | 
_ ae uh | i - ‘| _ ie Pre 


The method we have just used, combined with a particular way of choosing 
the C’; matrices, is called Gauss elimination. Gauss elimination requires less 
computation than finding the inverse of A because only the effect of A~! 
on the specific vector b is computed. MATLAB can solve for x by either 
method, as shown in Demo 4.2. For hand computations, we suggest 
choosing the C’; matrices so that C’; produces O’s everywhere below the 
diagonal in the first column, C’2 produces O's below the diagonal in the 
second column, and so on up to C,,_1. Then C’, produces O's above the 
diagonal in the n*) column, C41 produces O's above the diagonal in 
column n — 1, etc. The last one, C5,,_;, normalizes the diagonal elements 
to l's. We have assumed for simplicity that no O's on the diagonal will be 
encountered in hand computations. 

Exercise: 


Problem: Check that A~'A = I in Example 1 


Exercise: 


Problem: 


Augment Equation 3 as in Equation 19 and use the technique of Gauss 
elimination to solve for z. 


Demo 1 (MATLAB). From the command level of MATLAB, solve the 
matrix equation of Example 1 from "Linear Algebra: Introduction" by 


typing 


>> A 
>>. 5 


[1 1;0.3 -1.2] 
[85;0] 


You have entered the matrices A and b, which describe the problem. You 
can now solve for x by finding the inverse of A and multiplying b: 


>> X = inv(A) * b 


In this example the inverse is computed quickly because A is a small 
matrix. For a large (say, 30 x 30) matrix, the answer would take longer to 
compute, and the more efficient method of Gauss elimination would reduce 
waiting time. You can use Gauss elimination in MATLAB by typing 


>> XS A\ D 


You should get the same answer. Now type the MATLAB code required to 
compute Ax and to show Ax — b = 0. 
Exercise: 


Problem: 


(MATLAB) Write the following system of equations as Ax = b and 
solve using MATLAB: 
Equation: 


3 (21 — 23) +2(a.-1)-6=23 
Equation: 


473 = 72x92 —5 


Equation: 


6 (a, +42+2)=234+1. 
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In this section we use the linear algebra we have developed to find the 
voltages and currents in a simple electrical circuit, such as the one shown in 
Figure 1. There are many reasons why this might be necessary; in this 
example we need to know the current flowing through the lamp to tell 
whether or not it will glow. Too little current will have no visible effect; too 
much current will cause the lamp to burn out. We will apply a few physical 
laws relating the voltages and currents in a circuit, turn these laws into 
systems of linear equations, and solve the equations for the voltages and 
currents. 


SO 100 


-7@ 300 2 (lamp) 


A Simple Electrical Circuit 


Current, Voltage, and Resistance. We will use three physical quantities in 
our analysis of electrical circuits: current, voltage, and resistance. Current is 
the flow of electrical charge from one place to another. Electrons flowing 
through a wire or through some other electronic device comprise a current. 
Voltage is a difference in electric potential that makes electrons flow. 


Voltage is sometimes called electromotive force because it is like a “force” 
that moves electrons. Resistance is a property of the device through which 
the electron current flows. The lower the resistance of a device, the more 
easily current can flow through the device. 


The analogy of water flowing through pipes can help you develop intuition 
about electrical circuits. In this analogy, electrical current corresponds to the 
flow rate of water. Voltage corresponds to the pressure that forces the water 
to flow, and resistance is the friction of flow. A small pipe would impede 
the flow of water more than would a large pipe, so the small pipe would 
correspond to a higher resistance. While this analogy can be helpful, keep 
in mind that electricity is not water. All analogies break down at some 
point. 


We measure electrical current in amperes. The standard symbol for current 
is 2, and the direction of positive flow is indicated by an arrow on the circuit 
diagram. The arrow is for reference only; if the true current is in the 
opposite direction, we get negative values for 2. Because electrons are 
negatively charged, current is defined as flowing in the opposite direction as 
electron motion. But to reduce confusion, you should learn to think in terms 
of current rather than electron motion. 


A point in a circuit where several devices are connected together is called a 
node. The conservation law for current says that “what flows in must flow 
out of a node,” a principle known as Kirchhoff's current law. Kirchhoff's 
current law states that the sum of all currents leaving a node is zero. In this 
law, a current entering the node is considered to be a negative current 
leaving the node. 


Voltage is measured in volts and is usually written as v (or e). Since voltage 
is a difference in potential between two points (nodes), we can show it on a 
circuit diagram with a +-and a —sign to indicate which two nodes we are 
comparing and which one of the nodes is considered negative. As with 
current, the markings are for reference only and we may end up with a 
negative value of v. 


In an electrical circuit, one node is usually chosen as a reference node and is 
considered to have a voltage of zero. Then the voltage at every other node is 


measured with respect to the reference node. This saves us the trouble of 
always specifying pairs of nodes for voltage measurements and marking + 
and — signs for each voltage. Other names for the reference node are 
common and ground. 


A constant voltage source is a device that always forces the voltage 
between its two terminals to be a constant value. In Figure 1 the circle at the 
left represents a constant voltage source of 5 volts, so that the voltage at the 
upper (+) end is always exactly 5 volts higher than the voltage at the lower 
(-) end. A voltage source is something like a battery, but idealized. Real 
batteries do not maintain a constant output voltage under all conditions. 


Resistance is measured in ohms and is denoted by R. A resistor is shown as 
a zig-zag line in circuit diagrams and labeled with the value of its resistance 
in ohms. In this chapter we will consider only devices whose resistance is 
positive and the same in both directions. Ohm's law, also called the resistor 
law, relates the voltage and current in a resistor. For the resistor shown in 
Figure 2, with reference directions assigned to v andz as shown, Ohm's law 
is 

Equation: 


v —iR. 


Note that current flows from + to - through the resistor. 


Ohm's Law 


Example: 

Ohm's law and Kirchhoff's current law are the only principles we need to 
write equations that will allow us to find the voltages and currents in the 
resistive circuit of Figure 1. We begin by choosing a reference node and 
assigning variables to the voltages at every other node (with respect to the 
reference node). These choices are shown in Figure 3. 
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Assigning Node Voltages 


The constant voltage source forces v; to be exactly 5 volts higher than the 
reference node. Thus 
Equation: 


Dy = Sh. 


Next we write equations by applying Kirchhoff's current law to each node 
in the circuit (except the reference node and v1, whose voltages we already 
know). At the node labeled v2 are three paths for leaving current. The 
current leaving through the 50 ohm resistor can be found by Ohm's law, 
where the voltage across that resistor is v2 — v1 : 

Equation: 


et eae) 
50 


For current leaving through the 300 ohm resistor, the voltage is v2. Pay 
careful attention to the sign; since we are interested in the current leaving 
the node labeled v2, Figure 4.14 indicates that to apply Ohm's law we 
should take the voltage as +v2— reference = v2 — 0 = v2. So 


Equation: 


p V2 
1300 = —— 
300 300 
For the 100 ohm resistor, we can write 
Equation: 
gon a My 
100 100 


According to Kirchhoff's current law, the sum of these three leaving 
currents is zero: 
Equation: 


(v2—1) v (v2—v3)  __ 

0 ~~ + 300 + 199 = 9 
= 6 (v2 V1) + v2 + 3 (v2 — v3) = 0 
=> —6v; + 10v2 — 3v3 = 0. 


Notice that when we wrote the equation for the node labeled v2, the 
variable v2 had a + sign each time it occurred in the equation, while the 
others had a -sign. This is always the case, and watching for it can help you 
avoid sign errors. Now we apply Kirchhoff's current law at the node 
labeled v3 to get the equation 
Equation: 
(v3—v2) V3, 
100 Soy =U 
= (v3 = U2) + 50v3 = 0 
=> Ovi —1ve + 51lv3 = 0. 


Note that this time it is vg that always shows up with a + sign. 
Equations 2, 6, and 7 give us a system of three equations in the three 
unknown variables vj, v2, and v3. We now write them in matrix form as 
Equation: 


1 0 0 V] 
—6§ 10 —3 V0 = 
0 —l1 51 V3 


Go oO UO 


Exercise: 


Problem: 


(MATLAB) Use MATLAB to solve Equation 8. You should find 
Equation: 


v1 = 5.0000 volts 
Equation: 

vo — 3.0178 volts 
Equation: 


v3 = 0.0592 volt. 


What is the determinant of the coefficient matrix A? Is the solution 
unique? 


We can determine the current flowing through the lamp from v3 to ground 
in Example 1 by Ohm's law: 
Equation: 


= = = 0.0296 ampere. 


Vv 
R 


The visible effect will, of course, depend on the lamp. Let us assume that 
the specifications for our lamp indicate that 0.05 ampere or more is required 


before it will glow, and more than 0.075 ampere will cause it to burn out. In 
this case, our circuit would not make the lamp glow. 
Exercise: 


Problem: 
(MATLAB) Write and solve equations for the circuit in Figure 4. What 


are the voltages at the nodes labeled v; through v4? What is the 
current labeled 2; ? And 29? 


reference node 


A Resistive Network 
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Our analysis in Example 1 from "Linear Algebra: Circuit Analysis" and 
Problem 1 from "Linear Algebra: Circuit Analysis” indicates that not 
enough current will flow through the lamp to make it glow. We now wish to 
change the resistance of the 100 ohm resistor to a new value so that the 
lamp will glow. We replace 100 in the equations with an unknown 
resistance R. Equation 2 is unchanged, but Equation 6 becomes 

Equation: 


(v2—v1) v (v2—v3) __ 
rr + 399 + a: =0 


=> 6R(v2— v1) + Rve + 300 (v2 — v3) = 0 
=> —6Rv, + (7R + 300)v2 — 300v3 = 0. 


Equation 7 from "Linear Algebra: Circuit Analysis" becomes 
Equation: 


(v3—v2) V3 
(sw) 4 19 — 9 
=> 2 (v3 — v2) + Rv3 = 0 
=> Ov; — 2v2 + (R +4 2)u3 = 0. 


The matrix form of these equations is 
Equation: 


1 0 0 V1 4) 
—6R 7R +300 —-300 we = 0. 
0 =2 R+2 vs 0 


Write a MATLAB function file called buUilda to accept R as an input and 
return the matrix A in Equation 3 as an output. The first line of your 
function file should be 


Function A = builda(R); 


Now choose several values for R. For each choice, use your function 
builda and solve the resulting matrix equation Av = b for the voltages. 
Each time you choose a different R to build a different matrix A, check the 
determinant of A to be sure the equations have a unique solution: 


>> det(A) 


Make a table of R and the corresponding values of v3: 


R U3 


Now add a column to your table for the current through the lamp 7 = v3/2. 
Add rows to your table until you have found a value of R for which the 
lamp will glow. (7 needs to be between 0.05 and 0.075 ampere.) 
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Notes to Teachers and Students: 


In this chapter we introduce matrix data structures that may be used to 
represent two- and three-dimensional images. The demonstration program 
shows students how to create a function file for creating images from these 
data structures. We then show how to use matrix transformations for 
translating, scaling, and rotating images. Projections are used to project 
three-dimensional images onto two-dimensional planes placed at arbitrary 
locations. It is precisely such projections that we use to get perspective 
drawings on a two-dimensional surface of three-dimensional objects. The 
numerical experiment encourages students to manipulate a star field and 
view it from several points in space. 


Once again we consider certain problems essential to the chapter 
development. For this chapter be sure not to miss the following exercises: 
Exercise 2 in "Two-Dimensional Image Transformations", Exercise 1 in 
"Homogeneous Coordinates", Exercise 2 in "Homogeneous Coordinates", 


Exercise 5 in "Three-Dimensional Homogeneous Coordinates", and 
Exercise 2 in "Projections". 


Introduction 


Pictures play a vital role in human communication, in robotic 
manufacturing, and in digital imaging. In a typical application of digital 
imaging, a CCD camera records a digital picture frame that is read into the 
memory of a digital computer. The digital computer then manipulates this 
frame (or array) of data in order to crop, enlarge or reduce, enhance or 
smooth, translateor rotate the original picture. These procedures are called 
digital picture processing or computer graphics. When a sequence of 
picture frames is processed and displayed at video frame rates (30 frames 
per second), then we have an animated picture. 


In this chapter we use the linear algebra we developed in The chapter on 
Linear Algebra to develop a rudimentary set of tools for doing computer 
graphics on line drawings. We begin with an example: the rotation of a 
single point in the (, y) plane. 


Example: 
Point P has coordinates (3, 1) in the (a, y) plane as shown in Figure 1. 


Find the coordinates of the point P/, which is rotated 3 radians from P. 


Rotating a Single Point in the (a, y) Plane 


To solve this problem, we can begin by converting the point P from 
rectangular coordinates to polar coordinates. We have 
Equation: 
r= /2?+y? = V10 
9 =tan 1 (+) = 0.3217 radian. 


x 


The rotated point P/ has the same radius r, and its angle is 8 + = We 


now convert back to rectangular coordinates to find x/ and y/ for point PY: 
Equation: 


a'=rcos (0+ 7) © V10 cos (0.8453) = 2.10 
y =r sin (0+ Z) = V'10 sin (0. 8453) ~ 2. 37. 


So the rotated point P/ has coordinates (2.10, 2.37). 


Now imagine trying to rotate the graphical image of some complex object 
like an airplane. You could try to rotate all 10,000 (or so) points in the same 
way as the single point was just rotated. However, a much easier way to 
rotate all the points together is provided by linear algebra. In fact, with a 
single linear algebraic operation we can rotate and scale an entire object and 
project it from three dimensions to two for display on a flat screen or sheet 
of paper. 


In this chapter we study vector graphics, a linear algebraic method of 
storing and manipulating computer images. Vector graphics is especially 
suited to moving, rotating, and scaling (enlarging and reducing) images and 
objects within images. Cropping is often necessary too, although it is a little 
more difficult with vector graphics. Vector graphics also allows us to store 
objects in three dimensions and then view the objects from various 
locations in space by using projections. 


In vector graphics, pictures are drawn from straight lines.[footnote] A curve 
can be approximated as closely as desired by a series of short, straight lines. 
Clearly some pictures are better suited to representation by straight lines 
than are others. For example, we can achieve a fairly good representation of 
a building or an airplane in vector graphics, while a photograph of a forest 
would be extremely difficult to convert to straight lines. Many computer- 
aided design (CAD) programs use vector graphics to manipulate 
mechanical drawings. 

It is possible to extend these techniques to deal with some types of curves, 
but we will consider only straight lines for the sake of simplicity. 


When the time comes to actually display a vector graphics image, it may be 
necessary to alter the representation to match the display device. Personal 
computer display screens are divided into thousands of tiny rectangles 
called picture elements, or pixels. Each pixel is either off (black) or on 
(perhaps with variable intensity and/or color). With a CRT display, the 
electron beam scans the rows of pixels in a raster pattern. To draw a line on 
a pixel display device, we must first convert the line into a list of pixels to 
be illuminated. Dot matrix and laser printers are also pixel display devices, 
while pen plotters and a few specialized CRT devices can display vector 


graphics directly. We will let MATLAB do the conversion to pixels and 
automatically handle cropping when necessary. 


We begin our study of vector graphics by representing each point in an 
image by a vector. These vectors are arranged side-by-side into a matrix G 
containing all the points in the image. Other matrices will be used as 
operators to perform the desired transformations on the image points. For 
example, we will find a matrix R, which functions as a rotation: the matrix 
product RG represents a rotated version of the original image G. 


Vector Graphics: Two-Dimensional Image Representation 
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Point Matrix. To represent a straight-line image in computer memory, we 
must store a list of all the endpoints of the line segments that comprise the 
image. If the point P; = (2;, y;) is such an endpoint, we write it as the 
column vector 

Equation: 


Suppose there are n such endpoints in the entire image. Each point is 
included only once, even if several lines end at the same point. We can 
arrange the vectors P; into a point matrix: 


Equation: 
G= [P1p2P3. - - Pr] 
7 LQ £3 | 
Yi Yo Ys Yn 


We then store the point matrix G € #?*" as a two-dimensional array in 
computer memory. 


Example: 


Consider the list of points 


Equation: 
P= (0, 0) 
P, = (-1.5,5) 
Pea 3(45 2.3) 
Py = (4, -1). 


The corresponding point matrix is 
Equation: 


Line Matrix. The next thing we need to know is which pairs of points to 
connect with lines. To store this information for m lines, we will use a line 
matrix, H € Z?*™. The line matrix does not store line locations directly. 
Rather, it contains references to the points stored in G. To indicate a line 
between points p; and p;, we store the indices z and 7 as a pair. For the kth 
line in the image, we have the pair 


Equation: 
4 
hy = Bal 
J 


The order of 2 and 7 does not really matter since a line from p; to p; is the 
same as a line from p; to p;. Next we collect all the lines h, into a line 
matrix Hf: 

Equation: 


All the numbers in the line matrix H will be positive integers since they 
point to columns of G. To find the actual endpoints of a line, we look at 
columns 7 and 7 of the point matrix G. 


Example: 

To specify line segments connecting the four points of Example 1 into a 
quadrilateral, we use the line matrix 

Equation: 


2 4 
Ei : : 
23 4 1 


Alternatively, we can specify line segments to form a triangle from the first 


three points plus a line from P3 to P, : 
Equation: 


eZ 
H) = ; “|. 
23 1 4 


Figure 1 shows the points G connected first by Hy and then by Hg. 


Two Sets of Lines 


Demo 1 (MATLAB). Use your editor to enter the following MATLAB 
function file. Save it as vgraphl.m. 


function vgraphl (points, lines); 

% vgraphl (points, lines) plots the points as *'s 
and 

% connects the points with specified lines. The po 
ints 

% matrix should be 2xN, and the lines matrix shoul 
d be 2xM. 

% The field of view is preset to (-50,50) on both 
axes. 

% 

% Written by Richard T. Behrens, October 1989 

ie) 

% 


m=lLength(lines); % find the num 
ber of 
% lines. 
axis([-50 50 -50 50]) % set the axis 
scales 


axis('square' ) 
plot(points(1,:),points(2,:),'*') % plot the poi 
nts as * 


hold on % keep the poi 
nts... 
for 1=1:m % while plotti 
ng the 

% lines 


plot([points(1,lines(1,1)) points(1, lines(2,1) 


)],- 
[points(2,lines(2,lines(1,1)) points(2,lin 
es(2,1))],'-') 


end 


hold off 


After you have saved the function file, run MATLAB and type the 
following to enter the point and line matrices. (We begin with the 
transposes of the matrices to make them easier to enter.) 


>> G= [ 
0.6052 -0.4728; 
-0.4366 3.5555; 
-2.6644 7.9629; 
-7.2541 10.7547, 
-12.5091 11.5633; 
-12.5895 15.1372; 

-6.5602 13.7536; 
-31.2815 -7.7994; 
-38.8314 -9.9874; 
-44.0593 -1.1537; 
-38.8314 2.5453; 
-39.4017 9.4594; 
-39.3192 15.0932; 
-45.9561 23.4158] 


>> G = G' 


V 
V 
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At this point you should use MATLAB's “save” command to save these 
matrices to a disk file. Type 


>> save dippers 


After you have saved the matrices, use the function VGRAPH1 to draw the 
image by typing 


> vgraphi(G,H) 


The advantage of storing points and lines separately is that an object can be 
moved and scaled by operating only on the point matrix G. The line 
information in H remains the same since the same pairs of points are 
connected no matter where we put the points themselves. 


Surfaces and Objects. To describe a surface in three dimensions is a fairly 
complex task, especially if the surface is curved. For this reason, we will be 
satisfied with points and lines, sometimes visualizing flat surfaces based on 
the lines. On the other hand, it is a fairly simple matter to group the points 
and lines into distinct objects. We can define an object matrix K with one 
column for each object giving the ranges of points and lines associated with 
that object. Each column is defined as 

Equation: 


first point 


last point 
eS .. 

first line 

last line 


As with the line matrix H, the elements of K are integers. 


Example: 
Consider again Demo 1. We could group the points in G and the lines in H 
into two objects with the matrix 


Equation: 


OorAT He 
— 
or 


The first column of AK specifies that the first object (Ursa Minor) is made 
up of points 1 through 7 and lines 1 through 6, and the second column of 
K defines the second object (Ursa Major) as points 8 through 14 and lines 
7 through 12. 
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We now turn our attention to operating on the point matrix G to produce the 
desired transformations. We will consider rotation scaling and translation 
(moving) of objects. Rotation and scaling are done by matrix multiplication 
with a square transformation matrix A. If we call the transformed point 
matrix Grew, we have 

Equation: 


[Gnew] = [A] [G]. 


We call A a matrix operator because it “operates” on G through matrix 
multiplication. In contrast, translation must be done by matrix addition. 


In a later section you will see that it is advantageous to perform all 
operations by matrix operators and that we can modify our image 
representation to allow translation to be done with a matrix operator like 
rotation and scaling. We will call the modified representation homogeneous 
coordinates. 


Rotation. We saw in the chapter on linear algebra that the matrix that 
rotates points by an angle @ is 
Equation: 


A=R(6) = fae | 


sinO cos@ 


When applied to the point matrix G, this matrix operator rotates each point 
by the angle 0, regardless of the number of points. 


Example: 

We can use the rotation matrix to do the single point rotation of the 
example from "Vector Graphics: Introduction". We have a point matrix 
consisting of only the point (3, 1): 


Equation: 
a=/ 
1 


TT 
The necessary transformation matrix is R(@) with 6 = a Then the rotated 


point is given by 
Equation: 


Scaling. An object can be enlarged or reduced in each dimension inde- 
pendently. The matrix operator that scales an image by a factor of s, along 
the x-axis and s, along the y-axis is 

Equation: 


6. ee C a 


0 Sy 


Most often we take s, = s, to scale an image by the same amount in both 
dimensions. 
Exercise: 


Problem: 


Write out the following matrices. Simplify and give numerical answers 
to two decimal places: 


Exercise: 
Reflections 


Problem: What does S(—1, 1) do? S(1, —1)?S(—1, —1)? S(1, 1)? 
Exercise: 
Problem: 


0 -1.5 4 4 
Given G = and 0 = — find Grew =R(O)G. 
0 5 2.3 —1 a 


Give numerical answers to two decimal places. 
Exercise: 


Problem: 


Apply each of the transformations in Exercise 1 and 2 to the image 
Equation: 
1 122 1 2 4 
G= ‘Hh = : : 
1221 23 4 1 


Sketch the original image and each transformation of it. 


Translation. An object can be moved by adding a constant vector b to 


20 
every point in the object. For example, b = d| will move an object 20 


units to the right and 5 units down. We can write this in terms of the point 
matrix as 
Equation: 


Grew = G+b1? 


where 1 (read “the one-vector’”) is a vector of n I's: 
Equation: 


In MATLAB, 1 may be obtained by ones(n, 1). The outer product of b 
with 1 in Equation 7 simply serves to make n copies of b so that one copy 
can be added to each point in G. 


Vector Graphics: Composition of Transformations 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Often we will want to perform several operations on an object before we 


display the result. For example, suppose we want to rotate by = and reduce 


1 
to 3 size in each dimension: 


Equation: 


If there are n points in the matrix G, it will require 4n multiplications to 
perform each of these operations, for a total of 8n multiplications. 
However, we can save some multiplications by noting that 


Equation: 
tne =8(3.4) [a(Z)e] =A0 


where 
Equation: 


In other words, we take advantage of the fact that matrix multiplication is 
associative to combine S and R into a single operation A, which requires 
only 8 multiplications. Then we operate on G with A, which requires 4n 
multiplications. By “composing” the two operations, we have reduced the 
total from 8n to 4n + 8 multiplications. Furthermore, we can now build 
operators with complex actions by combining simple actions. 


Example: 


We can build an operator that stretches objects along a diagonal line by 
composing scaling and rotation. We must 


i. rotate the diagonal line to the x-axis with R(—6); 
ii. scale with S(s, 1); and 
iii. rotate back to the original orientation with R(6). 


Figure 1 shows a square being stretched along a 45° line. The composite 
operator that performs this directional stretching is 
Equation: 


A(6,s) = R(6)S(s, 1)R(—8) 
cos@ —sin#@| |s 0} |{cosé — sind 
~ fee cos0 | ; | end a 
pane +sin?9 (s— | 


(s —1)sinOcos@ cos?0 + ssin?6 


Note that the rightmost operator in a product of operators is applied first. 


Rotating and Scaling for Directional Stretching 


Vector Graphics: Homogeneous Coordinates 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


In the previous section we were able to combine rotation and scaling into a 
single composite operation by matrix multiplication. Unfortunately, 
translation cannot yet be included in the composite operator since we do it 
by addition rather than by multiplication. 


Example: 

1 
Suppose we wish to rotate the image G by a about the point (—10, 10). 
Our rotation matrix R(@) always rotates about the origin, so we must 
combine three transformations to accomplish this: 


i. translate the point (—10, 10) to the origin; 


ii. rotate = radians about the origin; and 
iii. translate the origin back to (—10, 10). 


10 


ue and 


For step (i), we have bp = 
Equation: 
Gi =G+bpl’. 


For step (ii), 
Equation: 


G@ = R(F)G 
= R(}) [6+ bol”) 
R(Z)G+R(¥)bol’. 


For step (iii), we can use — bg from step (i): 
Equation: 
Grew = Go = bol? 
= R(2)G+R(2)bol? — bol? 
R(z)G+ ((R(F) — Tbe]. 


In this example we were unable to find a single matrix operator A to do the 
entire job. The total transformation took the form 


Equation: 


Grew = AG +bl1?. 


This is called an affine transformation because it involves both 
multiplication by A and addition of a constant matrix. This is in contrast to 
the more desirable linear transformation, which involves only 
multiplication by A. 


We will now move toward a modified representation of the image and the 
operators by rewriting the last equation as 
Equation: 


where in the example we had A = R(2) and b = (R (=) — I) bo, 


Exercise: 


Problem: Show that, for any matrices A, B, C, D of compatible sizes, 
Equation: 


AB+CD= [AC KE 


G 
The matrix fe looks like 


Equation: 
Ly L2 Ln 
Yr Y2 Yn ; 
1 1 1 


and the points (x;, y;, 1) are called homogeneous coordinates. We can 
modify Equation 5 so that the new point matrix is also in homogeneous 


coordinates: 
Gaaw _ A b] [G 
17 | JOF 1] fat 


Equation: 
In the new representation, each point in the image has a third coordinate, 
which is always a 1. The homogeneous transformation is a 3 x 3 matrix, 


Equation: 
fe i 
Aa ; 


OF 1 


which is capable of translation, rotation, and scaling all by matrix multi- 
plication. Thus, using homogeneous coordinates, we can build composite 


transformations that include translation. 


In homogeneous coordinates, we have 


Equation: 
cos@é —siné 0 
R(6)= sinOd cosd 0 
0 0 1 
s, 0 O 
2 (S235, — O- s;0 
0 0 1 
I> Oe 4% 
Lit) = 08 ty 
001 
Example: 


The composite transformation to triple the size of an image and then move 
it 2 units to the left is 
Equation: 
1 0 0 3 0 
Ac Oe 1 0 = 503-0 
001 1 0 0 1 
On the other hand, the composite transformation to move an image 2 units 


to the left and then triple its size is 
Equation: 


In the latter case, the distance of the translation is also tripled. 


Exercise: 


Problem: 


Find a single composite transformation in homogeneous coordinates 
that rotates an image by angle @ about point (x;, y;) as in Example 1. 


Vector Graphics: Three-Dimensional Homogeneous Coordinates 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


We now consider the storage and manipulation of three-dimensional 
objects. We will continue to use homogeneous coordinates so that 
translation can be included in composite operators. Homogeneous 
coordinates in three dimensions will also allow us to do perspective 
projections so that we can view a three-dimensional object from any point 
in space. 


Image Representation. The three-dimensional form of the point matrix in 
homogeneous coordinates is 


Equation: 
“L1, LQ £3 Ln 
G- Yl Y2 Y3 ++ Yn xn. 
Zl 22 23 we Bn 
1 1 1 1 


The line matrix H is exactly as before, pointing to pairs of columns in G to 
connect with lines. Any other grouping matrices for objects, etc., are also 
unchanged. 


Image manipulations are done by a4 x 4 matrix A. To ensure that the 
fourth coordinate remains a 1, the operator A must have the structure 
Equation: 


The new image has point matrix 
Equation: 


Gnew = AG. 


Exercise: 


Problem: 


If the coordinates of the i“ point in G are (x;, yi, zi, 1), what are the 
coordinates of the i*® point in Gpew = AG when A is as given in 
Equation 2? 


Exercise: 


Problem: 


Write down the point matrix G for the unit cube (the cube with sides 
of length 1, with one corner at the origin and extending in the positive 
direction along each axis). Draw a sketch of the cube, numbering the 
vertices according to their order in your point matrix. Now write down 
the line matrix H to complete the representation of the cube. 


Left- and Right-Handed Coordinate Systems. In this book we work 
exclusively with right-handed coordinate systems. However, it is worth 
pointing out that there are two ways to arrange the axes in three dimensions. 
Figure 1(a) shows the usual right-handed coordinates, and the left-handed 
variation is shown in Figure 1(b). All possible choices of labels x, y, and z 
for the three coordinate axes can be rotated to fit one of these two figures, 
but no rotation will go from one to the other. 


Z y 


Three-Dimensional Coordinate Systems; (a) 
Right-handed, and (b) Left-handed 


Be careful to sketch a right-handed coordinate system when you are solving 
problems in this chapter. Some answers will not be the same for a left- 
handed system. 


Image Transformation. Three-dimensional operations are a little more 
complicated than their two-dimensional counterparts. For scaling and 
translation we now have three independent directions, so we generalize the 
operators of Equation 10 from "Vector Graphics: Homogeneous 
Coordinates" as 


Equation: 
s, 0 O O 
0 s, O O 
S LIPYIP? = 
(825 Sy» 82) 0 0 s, 0 
0 0 0O 1 
I 0 0 
010t 
LT (ta, ty tz) = : 
(ta, ty, te) 001 t, 
000 1 


Exercise: 


Problem: 


Show that T(—t,, —t,, —t,) is the inverse of T (tz, ty,¢,). T~! 
undoes the work of T. 


Rotation may be done about any arbitrary line in three dimensions. We will 
build up to the general case by first presenting the operators that rotate 
about the three coordinate axes. R, (0) rotates by angle 0 about the x-axis, 
with positive 6 going from the y-axis to the z-axis, as shown in Figure 2. In 
a similar fashion, positive rotation about the y-axis using R, (@) goes from 
z to x, and positive rotation about the z-axis goes from x to y, just as in two 
dimensions. We have the fundamental rotations 

Equation: 


0 0 

cosO —siné 
sind cosé 
0 0 
cosOé 0 sind 
0 1 0 
—sind 0 cosé 
0 0 0 


cos@ —sinf 0 


oo oO fF 


sind cosOd 0 
0 0 1 
0 0 0 


Foo” OFrFoaocqgcorRaoaooo 


A more general rotation about any line through the origin can be 
constructed by composition of the three fundamental rotations. Finally, by 
composing translation with the fundamental rotations, we can build an 
operator that rotates about any arbitrary line in space. 


Directions of Positive Rotation 


Example: 
To rotate by angle y about the line &, which lies in the x — y plane in 
Figure 3, we would 


i. rotate Y to the x-axis with R, (—9); 


ii. rotate by y about the x-axis with R, (y); and 
iii. rotate back to Y with R, (6). 


The composite operation would be 
Equation: 


Composition of Rotations 


Direction Cosines. As discussed in the chapter on Linear Algebra, a vector 
uv may be specified by its coordinates (x, y, z) or by its length and direction. 
The length of v is ||v||, and the direction can be specified in terms of the 
three direction cosines ( cos 0, cos 0,, cos @,). The angle @,, is measured 
between the vector v and the x-axis or, equivalently, between the vector v 
and the vector e, =[100]”. We have 

Equation: 


cos 6 =< ee = 
~ |Ivillles [Iv 


Likewise, 0, is measured between v and e, = (010]", and @, is measured 
between v and e, = [001]”. Thus 
Equation: 


cos 0,, = 
y 
Iv 
z 
cos 0, = TW 
The vector 
Equation: 
cos 0, 
u= cos 0, 
cos 6, 


is a unit vector in the direction of v, so we have 


Equation: 
cos 6, 
v= ||v|lu= ||v\|_ cos 4, 
cos 0, 
Exercise: 
Problem: Show that wu is a unit vector (i.e. 1 u|| = 1). 


The direction cosines are useful for specifying a line in space. Instead of 
giving two points P; and P2 on the line, we can give one point P; plus the 
direction cosines of any vector that points along the line. One such vector is 
the vector from P; to Po. 


z 
Arc tangents. Consider a vector v = { } in two dimensions. We know 
y 


that 
Equation: 


ines 
x 


for the angle 6 shown in Figure 4. If we know z and y, we can find @ using 
the arc tangent function 
Equation: 


Tangent and Arc Tangent 


In MATLAB, 


theta = atan(y/x) 


—F 1 
Unfortunately, the arc tangent always gives answers between a. and rm 


corresponding to points v in quadrants I and IV. The problem is that the 
ratio — is the same as the ratio — — —aA so quadrant III cannot be 


—az£ 
distinguished from quadrant I by the ratio 4 Likewise, quadrants II and IV 
are indistinguishable. 


The solution is the two-argument arc tangent function. In MATLAB, 


theta = atan2(y,x) 


will give the true angle 9 —7z and 7z in any of the four quadrants. 


Example: 
cos6,, 

Consider the direction vectoru = cos@, , as shown in Figure 5. What 
cos0, 


is the angle y,, between the projection of wu into the x — y plane and the y- 
axis? This is important because it is R, (y,) that will put wu in the y — z 
plane, and we need to know the angle y,, in order to carry out this rotation. 
First note that it is not the same as 6,. From the geometry of the figure, we 
can write 

Equation: 


This gives us a formula for y, in terms of the direction cosines of u. With 


the two-argument arc tangent, we can write 
Equation: 


y, =tan~* (cos 0,, cos 0,). 


Angles in Three Dimensions 


Exercise: 


Problem: 


a. Suppose point p/ is in the y — z plane in three dimensions, 

p’ = (0,y’,” , 1). Find 6 so that R, (0) will rotate p/ to the 
positive z-axis. (Hint: Use the two-argument arc tangent. 8 will be 
a function of y/ and z’.) 

b. Let p be any point in three-dimensional space, p = (2, y, z, 1). 
Find ~w so that R., (w) rotates p into the y — z plane. (Hint: Sketch 
the situation in three-dimensions, then sketch a two-dimensional 
view looking down at the x — y plane from the positive z-axis. 
Compare with Example 2.) Your answers to parts (a) and (b) can 
be composed into an operator Z(p) that rotates a given point p to 
the positive z-axis, Z(p) = Rz (@)R,z (y). 

c. Let ¥ be a line in three-dimensional space specified by a point 1 
=(x, y, Z, 1) and the direction cosines ( cos 0,, cos 6,, cos 0,). 
Use the following procedure to derive a composite operator 
R(y, &) that rotates by angle g about the line 7: 


i. translate 1 to the origin; 
ii. let u =( cos 0z, cos 6,, cos 8,, 1) and use Z(u) to align Y 
with the z-axis; 


ili. rotate by y about the z-axis; 
iv. undo step (ii); and 
v. undo step (i). 


Vector Graphics: Projections 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Computer screens and printers are two-dimensional display devices. We 
must somehow convert three-dimensional images to two dimensions in 
order to display them. This task is done by another kind of matrix operator 
called a projection. 


To build a projection, we first choose a projection plane in the three- 
dimensional space of the object we wish to view. All points of the object are 
then projected onto the plane. There are many different kinds of projections, 
corresponding to various geometric rules for mapping points in space onto a 
plane. We begin with the parallel projection illustrated in Figure 1, wherein 
the dotted lines between the points and their projections in the plane are all 
parallel to one another. These dotted lines are called projectors. 


Range. The projection plane is called the range of the projection. We will 
assume for now that the projection plane passes through the origin. If it 
does not, we may later compose the necessary translations with our 
projection. Three points, not all in a line, are required to determine a plane. 
We will take the origin as one of the points and suppose that the other two 
points are the vectors r; and rg. From these vectors, we form the matrix 
Equation: 


R = [rire], 


which determines the range of the projection. 


Oblique Parallel Projection 


Null-Space. To complete the specification of the parallel projection, we 
must specify the direction of the projectors. The line through the origin in 
this direction is called the null-space of the projection. Note that any line in 
the three-dimensional object that is parallel to the null-space will disappear 
to a point when projected. The null-space may be specified by a vector n 
pointing in the direction of the projectors. 


Orthogonal Projection. If the null-space is perpendicular to the range, we 
say that the projection is an orthogonal projection. The matrix operator for 
orthogonal projection is a function of the range. If we assume that F is 
specified in three-dimensional coordinates (not homogeneous! ), we have 
the following definition for the orthogonal projection onto the range of R: 


Equation: 


P(R) =R(RTR) ‘RT cB. 


In homogeneous coordinates, we embed this 3 x 3 projection matrix in the 
general 4 x 4 transformation: 
Equation: 


Example: 
Suppose we choose the xz — y plane as our projection plane. Vectors r; and 


72 can be any two points in the plane (besides the origin), so let's take 
Equation: 


Then the range matrix is 
Equation: 
i ak 
Re, Ope 
0 0 
The orthogonal projection looking down on the x — y plane from the z- 


axis is 
Equation: 


ne 1 0 0 aoe 1 0 0 
Be) as Fs al © 2 fee 
0 0 0 0 
1 0 O 
— 0 1 0 
0 0 0 
In homogeneous coordinates, 
Equation: 
1 0 0 O 
0 1 0 0 
es) . 
B= 9 900 
000 1 
Exercise: 
Problem: 
2 0 
Let the vectorsr; = O andrg= 3 specify the range of an 
0 0 


orthogonal projection. Find P(R) and P, (R). Compare with Example 
1 and explain any similarities. 


Example: 

In the chapter on linear algebra you learned that the projection of w onto x 
is given by 

Equation: 


(x, w)x 


“x, x) 


This is an orthogonal projection of w onto a line, but it is closely related to 
the projection into the plane just described. To see the similarity, let's work 
on the expression for z a little: 

Equation: 


| 
ms 
——™" 
a 
es 
a 
SS 
| 
aS 
a 
Ny 
q 
NS 


| 

a 
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a 
A 

a 
wn” 
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a 
= 


| 
J 
a 

E 


So we see that it can be written in the same way as the projection onto the 
plane. The only difference is that the range is now one-dimensional and 
specified by z in place of R. 


Oblique Projection. More generally, we may have a null-space 7 that is 
not perpendicular to the range R. The projection shown in Figure 1 is an 
oblique projection. Once again we start with nonhomogeneous coordinates 
for n and R and write the 3 x 3 oblique projection as 

Equation: 


E(R,n) = R{R? [J — —P(n)|R} ‘Rt [I — —P(n)| 


where 
Equation: 


As with the orthogonal projection, we can return to homogeneous 
coordinates by 


Equation: 
E(R n) O 
E, (R,n) = 
a (Rn) ie | 
Exercise: 
Problem: 


Prove and interpret the following properties of parallel projections 
(both orthogonal and oblique): 


a, P? = PE? = Ee. 

bPR=R; BR = Rk. 

c. En = 0; Pn = 0. (First show that R’n = 0 when n is 
orthogonal to FR.) 


Display from Projections. Even after we have used a projection, the image 
points are in three-dimensional homogeneous coordinates. How then do we 
display them on a two-dimensional display? If we had chosen the xz — y 
plane as the range of our projection, we could let the output device 
represent the z — y plane and ignore the z-coordinate of each point. The z- 
coordinates of the projected image would all be 0 anyway since the 
projected points would lie in the x — y plane. The fourth coordinate of 1 
may also be ignored for display. But even if the projection plane is not the 
x — y plane, we may perform a rotation to align the two planes and again 
let the output device represent the x — y plane. 


Perspective Projections. To obtain a perspective projection, we first 
choose a projection plane as we did for parallel projections. Instead of 
choosing a null-space parallel to all projectors, we now choose a viewpoint. 
The projectors all pass through the viewpoint in a perspective projection, as 
shown in Figure 2. For a viewpoint at z = d on the z-axis and a projection 
plane coinciding with the x — y plane, the three-dimensional homogeneous 
operator for perspective projection is 


Equation: 


oOo Oo Oo fF 
Co oO fF & 
re OO O 


The first thing you should notice about the perspective projection M(d) is 
that it violates the structure given in Equation 2 from "Vector Graphics: 
Three-Dimensional Homogeneous Coordinates" by having the 4,3 position 


equal to — q rather than to 0. This means that the fourth coordinates will 


not remain 1 in the new point matrix 
Equation: 


Grew = M(d)G. 


A Perspective Projection 


This situation is interpreted to mean that the new point (a, y, z, w) must be 
x z ; : 

renormalized to (=, ae =1) before the operation is considered complete. 
w ww 


Renormalization adds another computational step to the image 
transformation process. This is undesirable, but we are stuck with it if we 
wish to include perspective projections in our repertoire of transformations. 
Note that renormalization is a point-by-point process since w may be 
different for each point. 


How does renormalization affect composition of operators? We might 
expect that, each time we perform an operation requiring renormalization, 
we will be forced to stop and do the renormalization before going on with 
other operations. In this respect we are fortunate: we may put off the 
renormalization until all transformations are complete and renormalize once 
just before displaying the image. Thus we are able to compose perspective 
projections at will with other transformations. 

Exercise: 


Problem: 


What is the perspective projection matrix for a viewpoint at infinity, 
M(oo)? Interpret the result. 


Generalization of Projections. “Projection” is a technical term in linear 
algebra. A square matrix of any size is considered a projection if it obeys 
the property of Exercise 2(a), P? = P. All of our geometric projection 
matrices have this property. We have considered only three-dimensional 
spaces with a two-dimensional range and a one-dimensional null-space. In 
general, the dimensionality of the space may be split in any way between 
range and null-space. If we reinterpret homogeneous coordinates as four- 
dimensional space, we have projections with three-dimensional range and 
one-dimensional null-space, but the perspective projection is actually not 
quite a projection in the technical sense even though M? (d) = M (d) 
because perspective projection also includes renormalization. 


Vector Graphics: Numerical Experiment (Star Field) 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


With Earth at the origin, we may specify the star positions for the Big and 
Little Dippers in three-dimensional homogeneous coordinates. With light 
years as our units, we have G? = 


1.5441 -1.2064 153.0875 1.0000 
-1.0386 8.4588 142.7458 1.0000 
=@..7955 26.2870 198.0698 1.0000 
-12.8807 19.0964 106.5383 1.0000 
-18.8926 17.461 90.6185 1.0000 
-45.1364 54.2706 215.1148 1.0000 
+9 ,.6222 20.1734 88.0062 1.0000 
-33.7097 -8.4048 64.6574 1.0000 
-33.7144 +0; 7191 52.3806 1.0000 
-43.8531 -1.1483 59.7193 1.0000 
-36.1086 2.3667 55,7927 1.0000 
- 34.2309 8.2181 52.1260 1.0000 
-30.7876 11.8182 46.9810 1.0000 
-61.8581 31.5183 80.7615 1.0000 


To make use of this data, we need a function to display it. Enter and save 
the following generalization of the function from Demo 1 in "Vector 
Graphics: Two Dimensional Image Representation". Call it vnagraph.m. 


function vhgraph (P,L,A,ps,ls); 
% vhgraph(P,L,A,PS,LS) graphs images whose points 
are 


% stored in P and whose lines are stored in L. The 
points 
% in P must be in homogeneous coordinates in eithe 
r 2 or 3 
% dimensions, with each column of P representing a 
point. 
% The lines are coded in L, with each column of L 
containing 
% 2 integers pointing to a pair of points in P to 
be 
% connected by a line segment. If A is present, th 
e points 
% in P are transformed to A*P beforre graphing. Fo 
r 3D data 
% points, only the first two coordinates are graph 
ed, SOA 
% should include the desired projection from 3D to 
2D. The 
% point symbol may be specified in PS and the line 
type in 
% LS, if desired 
% 
% Richard T. Behrens, October 1989. 
% 
% The first dection of the program determines the 
sizes of 
% all the input matrices and chacks that they make 
sense. 
[mp,np] = size(P); 
if (nargin > 1) 
[ml,nl] = size(L); 
else 
ml = 2; nl = 0; 
end 
if (nargin <=2) 
A = eye(mp); 
end 


[mA,nA] = size(A); 
if ((mp~= nA) | (ml~=2)) 
error('Incompatible sizes of input matrices.") 
end 
if (nargin <= 3); 


ps = '*'; 
end 
P = A*P; % Performs the transformation A on the 
points 


% (effect is only local to this functi 
on). 


% The next section contains a loop that renormaliz 
es the 
% homogeneous coordinates of the points if necessa 
ry. 
renorm = find((P(mA,:)~= 1)); 
if~isempty(renorm) 

for 1 = 1:length(renorm) 

P(:,renorm(i)) = P(:,renorm(1))/P(mA, renor 

m(i)); 


end 
end 


% The next program line sets a fixed scale output 

window 

% from -50 to 50 in both x and y directions on the 
screen. 

% For automatic scaling to include all points of t 
he 

% image, we could use instead the line q = min(min 
(P)); 

% r = max(max(P)); 

q = -50; r = 50; 

axis([q r q r]) 

axis('square' ) 


plot(P(1,:),P(2,:),ps) % Plots the points with sy 


mbol ps. 

hold on % Saves the points while w 
e plot 

for 1 = i:nl % lines with line type LS. 


plot([P(1,L(1,1))P(1,L(2,1))],.. 
[P(2,L(4,1) )P(2,L(2,1))],1s) 

end 

hold off 


Enter the point matrix given at rhe beginning of this section (and take its 
transpose to put it in the usual form). Also enter the line matrix from Demo 
1 in "Vector Graphics: Two Dimensional Image Representation". Save these 
two matrices and try looking at the image 


> save dip3d 
>> vhgraph(G,H) 


No dippers in sight, right? Without specifying a transformation matrix A, 
we have defaulted to looking down on the x — y plane from z = oo (a 
parallel projection). This is how the constellations would look from a 
distant galaxy (say, a billion light years north of here) through an enormous 
telescope. We need a perspective view from the origin (Earth), but first we 
need a set of functions to give us the fundamental operators with which we 
can build the desired projection. 


Take R, (6) as an example. The function to build it looks like 


function Ry = vhry (theta) ; 
% Rotation matrix about y-axis for 3-D homogeneous 
% coordinates. 

Ry = eye(4),; 

Ry(1,1) = cos(theta); 


Ry(3,3) = cos(theta); 
Ry(3,1) = -sin(theta); 
Ry(1,3) = sin(theta); 


Enter and save hry .m as given. Then write functions for 


Equation: 


R, (0) vhre. m 
R.(0) vhrz. m 
S(Sz, $y, $8z) vhs. m 
T(tz, ty, tz) vht. m 
M(d) vhm. m 


Useful MATLAB functions for this task include zeros, eye, and diag. 


Now build and use a perspective projection with viewpoint Earth and 
projection plane at z = 1000 behind the dippers: 


1. translate Earth to z = —1000 so that the projection plane coincides 
with the x — y plane: T(O, 0, —1000); 


2. use the fundamental perspective projection: M(—1000); and 
3. translate back: T(O, 0, 1000). 


> A = vht(0,0,1000) * vhm(-1000) * vht(0,0, -1000) 
>> vhgraph(G,H,A) 


Oops! Now the image is too big; it's mostly off the screen. Scale it down 
and have another look: 


> A = vhs(.06,.06,.06) * A 
>> vhgraph(G,H,A) 


Now the view should look familiar. Leave A as it is now: 
> save dip3d 
Experiment with scale and rotation about the z-axis. For example, try 


> vhgraph(G,H,vhrz(pi/2) * A) 


The two-dimensional star positions given in Demo 1 in "Vector Graphics: 
Two Dimensional Image Representation" were obtained from the three- 
dimensional positions with the composite operator A you are now using. To 
compare the two, type 


> Gnew =A * G 


> for i= 1:14 
Gnew(;,1) = Gnew(:,1)/Gnew(4,1); 
end 


>> Gnew 


and compare the x and y coordinates with those of Demo 1 in "Vector 
Graphics: Two Dimensional Image Representation". 


Astronomers give star positions in equatorial coordinates using right 
ascension, declination, and distance. The following function converts 
equatorial coordinates, which are spherical, to Cartesian coordinates with 
the z-axis pointing north, the x-axis pointing at the vernal (Spring) equinox 
in the constellation Pisces, and the y-axis pointing toward the Winter 
solstice in the constellation Opheuchus. 


function v = starxyz(rah, ram, decd, decm, dist) 

% starxyz is the cartesian coordinates of a star w 
hose 

% Spherical coordinates ( e.g. from a star catalog 
) are 

% 


% rah right ascension hours 

% ram right ascension minutes 

% decd declination degrees 

% decm declination minutes (should be 
negative 

% if decd is negative) 


% dist distance (light years) 
% 


phi = (pi/180) * (decd + decm/60); 
theta = (pi/i2) * (rah + ram/60); 


r = dist; 
v = [r * cos(phi) * cos(theta); - 
r * cos(phi) * sin(theta); 


r * sin(phi)]; 


Have you ever wondered what the constellations would look like from other 
places in the galaxy? We will soon see the answer. First we will view the 
dippers from Alpha Centauri, the nearest star, whose coordinates are 


-1.5680 1.3157 -3.6675. 


We will look toward the centroid of the fourteen stars in the dippers, located 
at 


-26.3632 12.8709 100.4714. 
To get the desired view, we must 


1. translate the viewpoint to the origin; 

2. rotate the centroid (direction to look) to the z-axis—note that the 
centroid will have new coordinates after step (1); and 

3. apply the composite 
A=S(2706, .06, 206) *1(0;, ©; 1000)" M(-1000) “1(0, 6; -10 
00) (as used to view from Earth). 


Write a function vhz .m based on Exercise 5 from "Vector Graphics: Three- 
Dimensional Homogeneous Coordinates" to accomplish step (2). Test it on 
several random points to make sure it works right. Now write a general 
perspective projection function called vaview.m. The function vaview 
should accept as inputs two vectors, the first specifying the viewpoint and 
the second the point to look toward. Its output should be a composite 
operator that performs all three of the preceding steps. 


Now we want to look toward the centroid of the dippers from Alpha 
Centauri. To do so, enter the vectors for the two points of interest and 
construct the view like this: 


> acentaurli = [-1.5680; 1.3157; -3.6675] 
> centroid = [-26.3632; 12.8709; 100.4714] 
> A = vhview(acentauri, centroid) 

>> vhgraph(G,H,A) 

> title('From Alpha Centauri' ) 


The farther we move from Earth, the more distorted the dippers will look in 
general. It should be easy now to view them from any desired location. Just 
choose a viewpoint, recalculate the composite operator for that viewpoint 
using vhview, and use vhgrapnh. Follow this procedure to view the 
dippers from each of the stars in the following list. You will need to use 
starxyZ first to convert their coordinates. 


Right Distance 
Star Ascension Declination (ly) 
oe 14h 40m 60° 50' 4.2 
Centauri 
Sirius 6h 45m -16°43' o25 
Arcturus 14h 16m 19°11' 16.6 
Pollux 7h 45m 28°02" aol 
Betelgeuse 5h 55m 7°24' SHEE Ro) 


Table 1 is a six-row and four-column table with the first row as a header, 
the first column describing different stars, and the values in the second 
through fourth columns describing the corresponding right ascension, 
declination, and distance for each star. 


Of course, star viewing is not the only application of vector graphics. Do 
some experiments with the unit cube (see Exercise 2 from "Vector 
Graphics: Three-Dimensional Homogeneous Coordinates"). View the cube 
from location (4,3,2) looking toward the origin using the procedure just 
outlined for stars. You may need to adjust the scaling to get a meaningful 
view. 
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Notes to Teachers and Students: 


Filtering is one of the most important things that electrical and computer engineers do. In this chapter 
we extend everyday understanding of filters to numerical filters. We then study weighted moving 
averages and exponential averages. We define the important test signals for electrical and computer 
engineering and show how filters respond to them. The idea that filters are characterized by their 
response to simple test signals is fundamental. In the numerical experiment, students explore the 
frequency response of a simple filter, a concept that forms the basis of circuit theory, electronics, optics 
and lasers, solid-state devices, communications, and control. 


Introduction 


A filter is any device that passes material, light, sound, current, velocity, or information according to 
some rule of selectivity. Material (or mechanical) filters are commonplace in your everyday life: 


i. coffee filters pass flavored water while filtering out coffee grounds; 

ii. Goretex fibers pass small, warm perspiration droplets while filtering out large, cool droplets of 
rain or snow; 

iii. fiberglass strands in a furnace filter pass warm air while filtering out particles of dirt and dust; 

iv. a centrifuge retains material of low density while spinning out (or filtering out) material of high 
density; and 

v. an electrostatic precipitator filters out dust and other effluents by attaching charge to them and 
using an electric field to move the charged particles to a high potential drain. 


The first three of these examples selectively pass material according to size; the last two selectively 
pass material according to its mass density. 


Typical filters for light are 


i. UV filters on camera lenses and eyeglasses that pass light in the range of visible wavelengths 
while blocking light in the invisible (but damaging) ultraviolet range; 
ii. polaroid lenses that pass light that is randomly polarized while blocking out glare that is linearly 
polarized; 
iii. green fabrics that reflect green light and absorb other colors; 
iv. red taillights that pass light in the long wavelength red range and reflect light in the short 
wavelength violet range (look at the inside of your taillights to see violet); and 
v. glacial ice that absorbs all but the blue wavelengths so that it appears blue. 


Exercise: 


Problem: List as many examples of natural and man-made sound filters as you can. 


Satellite Television. Among current filters, the tuner in a super-heterodyne receiver is, perhaps, the 
first example that comes to mind. But satellite TV filters are another fascinating example. A typical C- 
band satellite has twelve transponders (or repeaters), each of which transmits microwave radiation in a 
personalized 36 MHz band. (The abbreviation MHz stands for megahertz, or 10° Hz, or 10° cycles per 
second. Other common abbreviations are Hz for 1 Hz, kHz for 10? Hz, and GHz for 10° Hz.) Each 
transponder actually transmits two channels of information, one vertically polarized and one 
horizontally polarized. There is an 8 MHz guard band between each band, and the vertical and 
horizontal channels are offset by 20 MHz. The transmission scheme for the 24 channels is illustrated in 
Figure 1. The entire transmission band extends over 540 MHz, from 3.7 x 10° Hz to 4.24 x 10° Hz. 
The satellite receiver has two different microwave detectors, one for vertical and one for horizontal 
polarization, and a microwave tuner to tune into the microwave band of interest. 

Exercise: 


Problem: Check that the transmission scheme of Figure 1 consumes 540 MHz of bandwidth. 


Exercise: 


Problem: 


List as many examples of natural and man-made devices for velocity filtering as you can. 


MH 
36 MHz 
Vi, «es we 
37x10 9 
8 MHz 


20 MHz 36 MHZ bre 
<-> +> 
, A, —_ H, = >2 2 


Satellite TV— Vj, Vertically Polarized Channel 7; H,;, 
Horizontally Polarized Channel j 


An Aside on Hertz and Seconds. The abbreviation Hz stands for hertz, or cycles/second. It is used to 
describe the frequency of a sinusoidal signal. For example, house current is 60 Hz, meaning that it has 
60 cycles each second. The inverse of Hz is seconds or, more precisely, seconds/cycle, the period of 1 
cycle. For example, the period of 1 cycle for house current is 1/60 second. When we are dealing with 
sound, electricity, and electromagnetic radiation, we need a concise language for dealing with signals 
and waves whose frequencies range from 0 Hz (called DC or direct current) to 10'8 Hz (visible light). 
Table 1 summarizes the terms and symbols used to describe the frequency and period of signals that 
range in frequency from 0 Hz to 10!? Hz. 


Frequency Period 
Hz Term Units Seconds Term Units Example 


battery current: 


Hz hertz 1 Hz sec second 1 sec om 
house current: 
60 Hz 
3 3 ‘ 
kHz kilohertz a msec millisecond = zudirequency 
Hz sec sound 
108 clock 
MHz megahertz Hz psec microsecond 10sec frequencies in 
microcomputers 
microwave 
9 . . 
GHz gigahertz 1 nsec nanosecond 10sec acienoy sa 
Hz satellite 
communication 
12 -12 : 
THz terahertz mn psec picosecond ») pirated 
Hz sec radiation 


Terms and symbols for Sinusoidal Signals 


Numerical Filters. Rather amazingly, these ideas extend to the domain of numerical filters, the topic of 
this chapter. Numerical filters are just schemes for weighting and summing strings of numbers. Stock 
prices are typically averaged with numerical filters. The curves in Figure 2 illustrate the daily closing 
average for Kellogg's common stock and two moving averages. The 50-day moving average is obtained 
by passing the daily closing average through a numerical filter that averages the most current 50 days' 
worth of closing averages. The 200-day moving average for the stock price is obtained by passing the 
daily closing prices through a numerical filter that averages the most current 200 days' worth of daily 
closing averages. The daily closing averages show fine-grained variation but tend to conceal trends. 
The 50-day and 200-day averages show less fine-grained variation but give a clearer picture of trends. 
In fact, this is one of the key ideas in numerical filtering: by selecting our method of averaging, we can 
filter out fine-grained variations and pass long-term trends (or vice versa), or we can filter out periodic 
variations and pass nonperiodic variations (or vice versa). Figure 2 illustrates that moving averages 
typically lag increasing sequences of numbers and lead decreasing sequences. Can you explain why? 


We will call any algorithm or procedure for transforming one set of numbers into another set of 
numbers a numerical filter or digital filter. Digital filters, consisting of memories and arithmetic logic 
units (ALUs), are implemented in VLSI circuits and used for communication, control, and 
instrumentation. They are also implemented in random-—or semicustom-—logic circuits and in 
programmable microcomputer systems. The inputs to a digital filter are typically electronic 
measurements that are produced by A/D (analog-to-digital) conversion of the output of an electrical or 
mechanical sensor. The outputs of the filter are “processed,” “filtered,” or “smoothed” versions of the 
measurements. In your more advanced courses in electrical and computer engineering you will study 
signal processing and system theory, assembly language programming, microprocessor system 


development, and computer design. In these courses you will study the design and programming of 
hardware that may be used for digital filtering. 
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Dow-Jones Averages (Adapted from the New York Stock Exchange, Daily Graphs, 
William O'Neil and Co., Inc., Los Angeles, California) 
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The simplest numerical filter is the simple averaging filter. This filter is 
defined by the equation 
Equation: 


The filter output x is the average of the N filter inputs w1, ua, ... , Uy. These 
inputs may be real or complex numbers, and x may be real or complex. This 
simple averaging filter is illustrated in Figure 1. 


A Simple Averaging Filter 


Example: 


If the averaging filter is excited by the constant sequence 
tu] = Ug = --- = uy = u, then the output is 
Equation: 


1 xX 
a 


The output is, truly, the average of the inputs. Now suppose the filter is 
excited by the linearly increasing sequence 
Equation: 


Lee lee 


This sequence is plotted in Figure 2. How do we sum such a sequence in 
order to produce the average x? For NV even, the average may be written as 
Equation: 


il 
B= ae + ay) 


1 1 
N salle, <b eaici)) <= eee SE ay (ene + Bv/2)41)- 


N 


N 
Each pair-sum in parentheses equals NV + 1, and there are = such pair- 
sums, so the average is 
Equation: 


1 N IN Se Il 


4 = 


This is certainly a reasonable answer for the average of a linearly 
increasing sequence. See Figure 2. 


Ie 2 Oo. OS Oe 8 


(NV) 


Linearly Increasing Sequence 


Exercise: 


Problem: 


1X 
Write x = W ye nm as asum of pair-sums for NV odd. What does x 
n=1 


equal? 


General Sum Formula. Suppose the input to the simple averaging filter is 
the polynomial sequence 
Equation: 


where k is a non-negative integer such as k = 0,1, 2,.... The output of the 
filter is 
Equation: 


eae 
zy =m. 


n=1 


We rewrite x as ao) to remind ourselves that we are averaging NV numbers, 


each of which is n*. For example, when N = 8 and k = 2, 
Equation: 


8 
2) 1 2 
tet See ee eae 04) 
Rather than study the average a), we will study the sum NV a) and divide 
by WN at the very end: 
Equation: 


N 
5 = Na® = Son. 


k ; 
The sum S' “) may be rewritten as the sum 
Equation: 


s&) = Se aN 
= SY) +N*. 


This result is very important because it tells us that the sum sv. viewed as 


a function of N, obeys a recursion in which S$ «) is just the sum using one 


less input, namely, S hoe plus N*. Now, since polynomials are the most 


general functions that obey such recursions, we know that 3) must be a 
polynomial of order & + 1 in the variable N: 


Equation: 


s\) =aj+a,N+a,N2+--: eNO 


Let's check to see that this polynomial really can obey the required 
recursion. First note that S' ce is the following polynomial: 
Equation: 


Be ap aN = 1) eee (NS 1). 


The term (NV — 1)*** produces (5+) eH (_1)° a (#1) wF(-1)' Aes 
. (Remember the binomial expansion?) Therefore the difference between 
S® ands is 

N N-1 
Equation: 


SO) Seg taNt---+oN*. 


This recursion is general enough to produce the difference Nk provided we 
can solve for ao, @1,... , @g41 to make cg = Cy = +--+ = Ce_1 = O and 


Ck — 1. We know that sy = 0 for N = 0, so we know that ap = 0, 


meaning that the polynomial for S i ) can really be written as 
Equation: 


s&) = a,N +a,N? te ok et Oe 


In order to solve for the coefficients of this polynomial, we propose to write 
k 
out our equation for 3) as follows: 


Equation: 


(N=1) SW) = ree ee ae 

(NV _ 2) sf) = 2ayt+ere+ OF peg 
(N=3) 5) = Te eee a 
(N=k) s = kay +--- +key 


(N=k+1) SO = (R+1)ar+---+ (R41) apy. 


Using the linear algebra we learned earlier, we may write these equations as 
the matrix equation 


Equation: 
1 in g\*) 
2 4 eee gk+1 ag sh) 
k ke zn OO 
(kK+1) (k+1)? .. (k+1)**"] Loe g) 


The terms on the right-hand side of the equal sign are “initial conditions” 
that tell us how the sum S' @) begins for N = 1, 2,...,4 + 1. These initial 


conditions must be computed directly. (For example, sh) =" 429") 
Then the linear system of (kK + 1) equations in (&K + 1) unknowns may be 
solved for a1, @2,...,@x%41. The solution for S Ms is then complete, and we 
may use it to solve for S i“ for arbitrary NV. 


Example: 
When k = 2, we have the following equation for the coefficients aj, ao, 


and a3 in the polynomial so) = a)N aN as - 


Equation: 


lees Whey ite 1 
24 8 | la] = 22422? | = 
3.9 27] las 1? 492-5 32 14 


Exercise: 


Problem: 


Solve for a1, @2, @3 in the linear equation of Example 2. Show that 
se) =a,N + aN? + a3N°® obeys the recursion 
Sy =5) 4S. 

Exercise: 


Problem: 


(MATLAB) Write a MATLAB program to determine the coefficients 


) 


Qj, 42,...,@%41 for the polynomial S$ . . Generate a table of formulas 


for the averages 2) for k = 1, 2,...,5. Evaluate these formulas for 


N = 2,A, 8, and 16. 


Exponential Sums. When the input to an averaging filter is the sequence 
Equation: 


in,=a 5 W011, 2,641) 1, 


we Say that the input is exponential (or geometric). Typical sequences are 
illustrated in Figure 6.5 fora = 0.9,a@ = 1, anda = 1.1. Don't let it throw 
you that we have changed the index to run from 0 to NV — 1 rather than 
from 1 to N. This change is not fundamentally important, but it simplifies 
our study. The sum of the inputs is 

Equation: 


OP versus) * 


Exponential Sequences 


How do we evaluate this sum? Well, we note that the sum aS'y is 
Equation: 


aSn = Yn ae = De 1 ae 


Therefore, provided a # 1, the sum S'y is 
Equation: 


Lae 


Sv = 


l-a 


This formula, discovered already in the chapter covering the functions e* 
and e/”, works for a ~ 1. When a = 1, then Sy = N: 
Equation: 


1—a 
oe Oe 
N, a=1. 


When |a| < 1, then a® — 0 for N — oo, and we have the asymptotic 
formula 


Equation: 
im Suse =, alee 
ee Ge an 
Exercise: 
Problem: 


N-1 
1 
Evaluate Sy = > a” and X vy = —Sw fora = 0.9,1, and 1.1 and 
n=0 N 


for N = 1,2,4,8, 16, and 32. 


Exercise: 
N-1 
Problem: Prove that Sy = S° a” obeys the recursion 
n=0 
Equation: 
Sn = Syn_-1 t+ “re 


Prove that S;jy = N obeys this recursion for a = 1 and that 
i= N 
= i a obeys it fora # 1. 
—@ 


Recursive Computation. Every sum of the form 
Equation: 


N-1 
Sv= ) Un 
n=—0 


obeys the recursion 
Equation: 


Sw = Sy_-1 + un-1- 
This means that when summing numbers you may “use them and discard 
them.” That is, you do not need to read them, store them, and sum them. 


You may read ug to form $4 and discard uo; add wu to Sj and discard wy ; 
add ue to Se ; and continue. 


Uo (+) 


S; 
U | (+) 
S, 
U. (+) 
D5 
U5 @ 
S4 


The Recursion S,41 = Sp + Un 


This is very important for hardware and software implementations of 
running sums. You need only store the current sum, not the measurements 
that produced it. Two illustrations of the recursion S,41 = Sy + Un are 
provided in Figure 4, The diagram on the left is self-explanatory. The 
diagram on the right says that the sum S,, is stored in a memory location, to 
be added to u, to produce S;,,,;, which is then stored back in the memory 
location to be added to u,,41, and so on. 
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Weighted, tapered, or windowed averages are straightforward 
generalizations of simple averages. They take the form 
Equation: 


with the constraint that the “weights in the window,” w,,, sum to 1: 
Equation: 


N 
So wn = 1, 
n=1 


1 
When w, = W then z is the simple average studied in the section on 


"Simple Averages". 


Example: 

There are many windows that are commonly used in engineering practice. 
For N odd, the standard triangular window is 

Equation: 


Wn 


2 a 2 pee 
Ps) Ze L | 


This window, illustrated in Figure 1, weights the input w(j+1)/2 by 


2 
2 
and the inputs w; and wy by sot The most general 


Neel 
triangular window takes the form 
Equation: 
N+1 
Wn =a(1— pf > —n|);a,B>0, N odd. 


Triangular Window 


Exercise: 


Problem: 


Determine the constraints on @ and ( to make the general triangular 


window a valid window (i.e., e Wn = 1). Show that 
n=1 
a= + i = ( is a valid solution. Propose another solution that you 
like. 
Exercise: 
Problem: 


You are taking three 3-credit courses, one 5-credit course, and one 2- 
credit course. Write down the weighted average for computing your 
GPA in a system that awards 4.0 points for an A, 3.0 points for a B,... 
, and (horrors!) 0 points for an F’. 
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Moving averages are generalizations of weighted averages. They are 
designed to “run along an input sequence, computing weighted averages as 
they go.” A typical moving average over N inputs takes the form 
Equation: 


N-1 
In = oa WkUn—k 


= WoUn + W1Un-1 1 +++ F WN-1Un—-(N-1): 


The most current input, w,, is weighted by wa; the next most current input, 
Un—1, 1S weighted by w 1 ; and so on. This weighting is illustrated in Figure 
1. The sequence of weights, wo through wy_ , is called a “window,” a 
“weighting sequence,” or a “filter.” In the example illustrated in Figure 6.8, 
the current value wu, is weighted more heavily than the least current value. 
This is typical (but not essential) because we usually want z,, to reflect 
more of the recent past than the distant past. 


Moving Average 


Example: 
1 
When the weights wg, wy, ..-, Wy_ , are all equal to W’ then the moving 


average Z,, is a “simple moving average”: 
Equation: 


1 
Ln = yy lun + Un-1 as ee +uy-1]. 


This is the same as the simple average, but now the simple average moves 
along the sequence of inputs, averaging the NV most current values. 


Exercise: 
Problem: 
N-1 


Evaluate the moving average 7, = Wink for the inputs 
k=0 


0; 0 0 


ah 
Us 2D; 
0;. m0 
eo — 
i, 7 > 0. 


Interpret your findings. 


Exercise: 
Problem: 
NA 
Evaluate the simple moving average 7, = —Un_, when wu, is 
ro NV 
the sequence 
Equation: 
0, n<O 
U —- 
rf a”, n>0. 


Interpret your result. 


Example: 
When the weights w, equal woa” forn = 0,1,..., NW — 1, then the 


moving average x, takes the form 
iN 


LUG @ Unk. 

k=0 
When a < 1, then up is weighted more heavily than up—(v—1); whena > 1 
» Un—(N-1) 1S weighted more heavily than u, ; when a = 1, u, is weighted 
the same as Up_(—1)- 


Exercise: 


Problem: 


Evaluate wo so that the exponential weighting sequence w,, = 


N-1 
woa” (n = 0,1,...,. N — 1) is a valid window (i.e., Wy = 1): 
n=0 
Exercise: 
Problem: 
N-1 
Compute the moving average x, = woa*un_—, when the input 
k=0 
sequence u,, 1S 
Equation: 
b”, n>O 
t= 
0, n<O0. 


What happens when 6b = a? Can you explain this? 


Filtering: Exponential Averages and Recursive Filters 
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Suppose we try to extend our method for computing finite moving averages 
to infinite moving averages of the form 
Equation: 


CO 
a yor g Week 
= WouUn + W1Un-1 + +++ + Wi9000Un—1000 ++ °° 


In general, this moving average would require infinite memory for the 
weighting coefficients wo, w1,... and for the inputs un, Un—1, -.-. 
Furthermore, the hardware for multiplying w;,u,_,; would have to be 
infinitely fast to compute the infinite moving average in finite time. All of 
this is clearly fanciful and implausible (not to mention impossible). But 
what if the weights take the exponential form 


Equation: 
0, k<0 
WwW —— 
. woa*, k>0? 


Does any simplification result? There is hope because the weighting 
sequence obeys the recursion 
Equation: 


0, k<0 
Wk= Wo, k=0 
awrp1 k> 1. 


This recursion may be rewritten as follows, for k > 1: 
Equation: 


We — awy_1 = 0, k> 1. 


Let's now manipulate the infinite moving average and use the recursion for 
the weights to see what happens. You must follow every step: 
Equation: 
Y= arn 
[e@) 
= 1 Rak Wotan 
CO 
pal WWk-1Un—k + Won 
Ay m=0 WmUn—1-m + WoUn 


ALn—-1 + WoUn- 


| 


| 


This result is fundamentally important because it says that the output of the 
infinite exponential moving average may be computed by scaling the 
previous output x,,_; by the constant a, scaling the new input wu, by wo, 
and adding. Only three memory locations must be allocated: one for wo, 
one for a, and one for £,_;. Only two multiplies must be implemented: one 
for az,—1 and one for wow. A diagram of the recursion is given in Figure 
1. In this recursion, the old value of the exponential moving average, %p_1, 
is scaled by a and added to wou, to produce the new exponential moving 
average Z,. This new value is stored in memory, where it becomes z,_; in 
the next step of the recursion, and so on. 


Dn -| 


W 


Recursive Implementation of an Exponential Moving 
Average 


Exercise: 
Problem: 
Try to extend the recursion of the previous paragraphs to the weighted 


average 
Equation: 


What goes wrong? 
Exercise: 
Problem: 
Compute the output of the exponential moving average 7, = 


ALn_1 + WoUn when the input is 
Equation: 


0, n<0 
a 
uy n>Qd. 


Plot your result versus 7. 


Exercise: 


Problem: Compute wo in the exponential weighting sequence 


Equation: 
0, n<O0 
WwW = 
. a"wo, n>O 


to make the weighting sequence a valid window. (This is a special case 
of Exercise 3 from Filtering: Moving Averages.) Assume —1<a<1 


Filtering: Test Sequences 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


When we design a filter, we design it for a purpose. For example, a moving 
average filter is often designed to pass relatively constant data while 
averaging out relatively variable data. In an effort to clarify the behavior of 
a filter, we typically analyze its response to a standard set of test signals. 
We will call the impulse, the step, and the complex exponential the standard 
test signals. 


Unit Pulse Sequence. The unit pulse sequence is the sequence 


Equation: 
1, n=0 
n— On = 
“ ‘0 n#£0. 


This sequence, illustrated in Figure 1, consists of all zeros except for a 
single one at n = 0. If the unit pulse sequence is passed through a moving 
average filter (whether finite or not), then the output is called the unit pulse 
response: 

Equation: 


hn = ye 9 knee 


= Wn- 


OF ao 


Unit Pulse Sequence 


(Note that 6,,_, = 0 unless n = k.) So the unit pulse sequence may be used 
to read out the weights of a moving average filter. It is common practice to 
use wy; (the k‘® weight) and h,, (the k‘" impulse response) interchangeably. 
Exercise: 


Problem: 


Find the unit pulse response for the finite moving average 


N-1 
= WkUn—k. Caution: You must consider 
k=0 
n<0,0<n<N-I,andn>N. 
Exercise: 
Problem: 


Find the unit pulse response for the recursive filter 
Ln = ALn-1 + WoUn. 


Unit Step Sequence. The unit step sequence is the sequence 
Equation: 


¢ 1, n=O 
Un = Sn = 
0, n<Q0. 


This sequence is illustrated in Figure 2. When this sequence is applied to a 
moving average filter, the result is the unit step response 
Equation: 


Gn = eo Wk 
= yr k=0 hr. 


The unit step response is just the sequence of partial sums of the unit pulse 
response. 


fé,vs n 


| e@ee#ee 


Unit Step Sequence 


Exercise: 


Problem: 


Find the unit step response for the finite moving average filter 
a 

= S- WkUn—k. Specialize your general result to the special case 
k=0 


1 
where wy, = — fork = 0,1,...,N —1. 
N 


Exercise: 


Problem: 


Find the unit step response for the recursive filter 
Cyn = ALynai + Wotkns 


Complex Exponential Sequence. The complex exponential sequence is the 
sequence 
Equation: 


u, =e, k= 0, +1, +£2,... 


This sequence, illustrated in Figure 3, is a “discrete-time phasor” that 
“ratchets” counterclockwise (CCW) as k moves to k + 1 and clockwise 
(CW) as k moves to k — 1. Each time the phasor ratchets, it turns out an 
angle of 8. Why should such a sequence be a useful test sequence? There 
are two reasons. 


0528 


Discrete-Time Phasor 


(i) e7*? represents (or codes) cos kO. The real part of the sequence e/*? is 
the cosinusoidal sequence cos ké: 
Equation: 


Re [e?*”] —cos ké. 


Therefore the discrete-time phasor e?*’ represents (or codes) cos k@ in the 
same way that the continuous-time phasor e/“” codes cos wt. If the moving 
average filter 

Equation: 


CO 
Ci y hpuUn—k 
k=0 


has real coefficients, we can get the response to a cosinusoidal sequence by 
taking the real part of the following sum: 
Equation: 


In = dpohe cos (n—k)d 
Re[ Spo hueK 9?) 
= Relea > he! |: 


In this formula, the sum 
Equation: 


CO 
s. hye I 
k=0 


is called the complex frequency response of the filter and is given the 
symbol 
Equation: 


H (e%”) = S- hye 2”, 
k=0 


This complex frequency response is just a complex number, with a 
magnitude |\H (e””) | and a phase arg H (e”). Therefore the output of the 
moving average filter is 

Equation: 


Re [ei H (e””)] 
Re ein? | H (e?”) | eJargH (*)] 
= \H (e””)| cos [nd+ arg H (e%”)|. 


Ln 


This remarkable result says that the output is also cosinusoidal, but its 
amplitude is |H (e”) | rather than 1, and its phase is argH (e””) rather than 
0. In the examples to follow, we will show that the complex “gain” H (ce?) 
can be highly selective in 0, meaning that cosines of some angular 
frequencies are passed with little attenuation while cosines of other 
frequencies are dramatically attenuated. By choosing the filter coefficients, 
we can design the frequency selectivity we would like to have. 


(ii) e*’ is a sampled data version of e4“*. The discrete-time phasor e/”? 
can be produced physically by sampling the continuous-time phasor e?% 
the periodic sampling instants t, = kT’: 

Equation: 


at 


kO _ jut — pjwkT 
eJkO — ej ag 
t=kT 


C= al. 


The dimensions of @ are radians, the dimensions of w are radians/second, 
and the dimensions of 7’ are seconds. We call T' the sampling interval and 


1 
Tr the sampling rate or sampling frequency. If the original angular 


2 
frequency of the phasor e/ is increased to w +m (=) , then the 


discrete-time phasor remains e/*? : 
Equation: 


elle tm(2n/T)t = ed(wkT +km2n) = elke 


t=kT 


This means that all continuous-time phasors of the form eJl#+™(27/T)it “hide 
under the same alias” when viewed through the sampling operation. That is, 
the sampled-data phasor cannot distinguish the frequency w from the 


20 
frequency w + m T In your subsequent courses you will study aliasing in 


more detail and study the Nyquist rule for sampling: 
Equation: 


pe 


2 
ee 
02 T 20 


1 
This rule says that you must sample signals at a rate (=) that exceeds the 


22 
bandwidth — of the signal. 
20 


Example: 

Let's pass the cosinusoidal sequence uz = cos k@ through the finite 
moving average filter 

Equation: 


We know from our previous result that the output is 
Equation: 


Cit |H (e%) | cos [nd+ arg H (e”)]. 


The complex frequency response for this example is 
Equation: 
0) N-1 1 .—jk6 
H(e*) = Vino we” 
1 1~e7IN? 


N 1-e-# 


(Do you see your old friend, the finite sum formula, at work?) Let's try to 
manipulate the result into a more elegant form: 
Equation: 


—j(N/2)8 [pi(N/2)0_ 9—5(N/2)0 
apo ey LU ESEEE e 
(e ) — N eH62) [ei0) eH) 


—sl(v—1)/219 sin 8) 


me 
= sin(46) 


The magnitude of the function H (ce?) is 
Equation: 


20 
At 86 = 0, corresponding to a “DC phasor,” H (e””) equals 1; at 0 = W 


\H (e”) = 0). The magnitude of the complex frequency response is 
plotted in Figure 4. 


20 
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Frequency Selectivity of a Moving Average Filter 


This result shows that the moving average filter is frequency selective, 
passing low frequencies with gain near 1 and high frequencies with gain 
near 0. 


Exercise: 


Problem: Compute the phase of the complex frequency response 
Equation: 


Exercise: 


Problem: 


1 
Choose the filter length NV for the filter hy, = W k=051,.3,1V-—1, 


so that a 60 Hz cosine, sampled at the rate 7 180, is perfectly 
zeroed out as it comes through the filter. 

Exercise: 
Problem: 
(MATLAB) Write a MATLAB program to compute and plot the 
magnitude |H (e””) | and the phase arg H (e%”) versus —7 <<O< 7 
when 
Equation: 


: : in (40 
H (e#) = ape aa ats 2 ) 
2 


Choose suitable increments for 0. 
Exercise: 
Problem: 


Compute the complex frequency response 1 (e?) for the recursive 


filter 7, = A®n_1 + WoUn. 


Filtering: Numerical Experiment (Frequency Response of First-Order 
Filter) 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Consider the exponential moving average filter 


Equation: 
CO 
= a Un—k: a = 0. 98. 
k=0 
1. Write out a few terms of the sum to show how the filter works. 


2. Write x, aS a recursion and discuss the computer memory required to 
implement the filter. 

3. Compute the complex frequency response H e/” for the filter. 

4. Write a MATLAB program to plot the magnitude and phase of the 
complex frequency response H e/” versus @ for 9 = —xto + 7 in 


2 
steps of a Do this for two values of a, namely, a = 0. 98 and 


a = —0. 98. Explain your findings. 
5. Write a MATLAB program to pass the following signals through the 
filter when a = 0. 98: 


a ain = 0; 
Di ti = C4 
27 


CU; = £4608 $a" 


2 
Gti = 26-608 ai 


32 
27 
€&. Un = En COS —N 
16 
27 
f= £4 COS 3" 
27 


©. Ug = £008 ao" 
20 
hg 4645==-€45C08 a 
Plot the outputs for each case and interpret your findings in terms of the 


complex frequency response H e/° . Repeat step 5 fora = —0. 98. 
Interpret your findings. 


Binary Codes: Introduction 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Acknowledgment: Richard Hamming's book, Information Theory 
andCoding, Prentice-Hall, New York (1985) and C. T. Mullis's unpublished 
notes have influenced our treatment of binary codes. The numerical 
experiment was developed by Mullis. 


We use this chapter to introduce students to the communication paradigm 
and to show how arbitrary symbols may be represented by binary codes. 
These symbols and their corresponding binary codes may be computer 
instructions, integer data, approximations to real data, and so on. 


We develop some ad hoc tree codes for representing information and then 
develop Huffman codes for optimizing the use of bits. Hamming codes add 
check bits to a binary word so that errors may be detected and corrected. 
The numerical experiment has the students design a Huffman code for 
coding Lincoln's Gettysburg Address. 


Introduction 


It would be stretching our imagination to suggest that Sir Francis had digital 
audio on his minde (sic) when he wrote the prophetic words 


"Sir Francis Bacon, 1623 ...a man may express and signifie the intentions of 
his minde, at any distance... by... objects... capable of a twofold difference 
onely. " 


Nonetheless, this basic idea forms the basis of everything we do in digital 
computing, digital communications, and digital audio/video. In 1832, 
Samuel F. B. Morse used the very same idea to propose that telegram words 
be coded into binary addresses or binary codes that could be transmitted 
over telegraph lines and decoded at the receiving end to unravel the 
telegram. Morse abandoned his scheme, illustrated in Figure 1, as too 
complicated and, in 1838, proposed his fabled Morse code for coding letters 
(instead of words) into objects (dots, dashes, spaces) capable of a threefold 
difference onely (sic). 


word, W; 


word 
generator 
(inverse 

codebook) 


address 

generator 
(or 

codebook) 


address, 4; = 011010 


(associative memory) (memory) 
(coder) (decoder) 


Generalized Coder-Decoder 


The basic idea of Figure 1 is used today in cryptographic systems, where 
the “address a," is an encyphered version of a message w, ; in vector 
quantizers, where the “address a," is the address of a close approximation to 
data w; ; in coded satellite transmissions, where the “address a;" is a data 
word w; plus parity check bits for detecting and correcting errors; in digital 
audio systems, where the “address a," is a stretch of digitized and coded 
music; and in computer memories, where a; is an address (a coded version 
of a word of memory) and w; is a word in memory. 


In this chapter we study three fundamental questions in the construction of 
binary addresses or binary codes. First, what are plausible schemes for 
mapping symbols (such as words, letters, computer instructions, voltages, 
pressures, etc.) into binary codes? Second, what are plausible schemes for 
coding likely symbols with short binary words and unlikely symbols with 
long words in order to minimize the number of binary digits (bits) required 
to represent a message? Third, what are plausible schemes for “coding” 
binary words into longer binary words that contain “redundant bits” that 
may be used to detect and correct errors? These are not new questions. 
They have occupied the minds of many great thinkers. Sir Francis 
recognized that arbitrary messages had binary representations. Alan Turing, 
Alonzo Church, and Kurt Goedel studied binary codes for computations in 
their study of computable numbers and algorithms. Claude Shannon, R. C. 
Bose, Irving Reed, Richard Hamming, and many others have studied error 
control codes. Shan- non, David Huffman, and many others have studied 
the problem of efficiently coding information. 


In this chapter we outline the main ideas in binary coding and illustrate the 
role that binary coding plays in digital communications. In your subsequent 
courses in electrical and computer engineering you will study integrated 
circuits for building coders and decoders and mathematical models for 
designing good codes. 


Binary Codes: The Communication Paradigm 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


A paradigm is a pattern of ideas that form the foundation for a body of 
knowledge. A paradigm for (tele-) communication theory is a pattern of 
basic building blocks that may be applied to the dual problems of (i) 
reliably transmitting information from source to receiver at high speed or 
(ii) reliably storing information from source to memory at high density. 
High-speed communication permits us to accommodate many low-rate 
sources (such as audio) or one high-rate source (such as video). High- 
density storage permits us to store a large amount of information in a small 
space. For example, a typical 1.2 Mbyte floppy disc stores 9.6 x 10° bits of 
information, whereas a typical CD stores about 2 x 10° bits, enough for 


one hour's worth of high-quality sound. 
source channel 
(transmitter) 
(receiver) 
ee source channel 


Basic Building Blocks in a (Tele-) Communication 
System 


Figure 1 illustrates the basic building blocks that apply to any problem in 
the theory of (tele-) communication. The source is an arbitrary source of 
information. It can be the time-varying voltage at the output of a vibration 
sensor (such as an integrating accelerometer for measuring motion or a 
microphone for measuring sound pressure); it can be the charges stored in 
the CCD array of a solid-state camera; it can be the addresses generated 
from a sequence of keystrokes at a computer terminal; it can be a sequence 
of instructions in a computer program. The source coder is a device for 
turning primitive source outputs into more efficient representations. For 
example, in a recording studio, the source coder would convert analog 
voltages into digital approximations using an A/D converter; a fancy 
source coder would use a fancy A/D converter that finely quantized likely 
analog values and crudely quantized unlikely values. If the source is a 
source of discrete symbols like letters and numbers, then a fancy source 
code would assign short binary sequences to likely symbols (such as e) and 
long binary sequences to unlikely symbols (such as z). The channel coder 
adds “redundant bits” to the binary output of the source coder so that errors 
of transmission or storage may be detected and corrected. In the simplest 
example, a binary string of the form 01001001 would have an extra bit of 1 
added to give even parity (an even number of I's) to the string; the string 
10110111 would have an extra bit of 0 added to preserve the even parity. If 
one bit error is introduced in the channel, then the parity is odd and the 
receiver knows that an error has occurred. The modulator takes outputs of 
the channel coder, a stream of binary digits, and constructs an analog 
waveform that represents a block of bits. For example, in a 9600 baud 
Modem, five bits are used to determine one of 2° = 32 phases that are used 
to modulate the signal A cos (wt + wy). Each possible string of five bits has 
its own personalized phase, y, and this phase can be determined at the 
receiver. The signal A cos (wt + y) is an analog signal that may be 
transmitted over a channel (such as a telephone line, a microwave link, or a 
fiber-optic cable). The channel has a finite bandwidth, meaning that it 
distorts signals, and it is subject to noise or interference from other 
electromagnetic radiation. Therefore transmitted information arrives at the 
demodulator in imperfect form. The demodulator uses filters matched to the 
modulated signals to demodulate the phase and look up the corresponding 
bit stream. The channel decoder converts the coded bit stream into the 


information bit stream, and the source decoder looks up the corresponding 
symbol. This sequence of steps is illustrated symbolically in Figure 7.3. 


aj ——B 01 10 ——® 01 100— 4A cos(@t- [01 100]) 


aj <@#@— 01 10<— 01 100 <—- @ 


Symbolic Representation of Communication 


In your subsequent courses on communication theory you will study each 
block of Figure 1 in detail. You will find that every source of information 
has a characteristic complexity, called entropy, that determines the 
minimum rate at which bits must be generated in order to represent the 
source. You will also find that every communication channel has a 
characteristic tolerance for bits, called channel capacity. This capacity 
depends on signal-to-noise ratio and bandwidth. When the channel capacity 
exceeds the source entropy, then you can transmit information reliably; if it 
does not, then you cannot. 


Binary Codes: From Symbols to Binary Codes 


Note:This module is part of the collection, A First Course in Electrical and 
Computer Engineering. The LaTeX source files for this collection were 
created using an optical character recognition technology, and because of this 
process there may be more errors than usual. Please contact us if you 
discover any errors. 


Perhaps the most fundamental idea in communication theory is that arbitrary 
symbols may be represented by strings of binary digits. These strings are 
called binary words, binary addresses, or binary codes. In the simplest of 
cases, a finite alphabet consisting of the letters or symbols so, $1, ..., Syg_1 iS 
represented by binary codes. The obvious way to implement the 
representation is to let the i** binary code be the binary representation for the 
subscript 2: 

Equation: 


So ~ 000 = ag 


8S, ™ 001 = at 
S65 ™ 110 = a6 
87 ™ LE - = a7. 


The number of bits required for the binary code is NV where 
Equation: 


i | ea 


We say, roughly, that V =log, M. 


Octal Codes. When the number of symbols is large and the corresponding 
binary codes contain many bits, then we typically group the bits into groups 


of three and replace the binary code by its corresponding octal code. For 


example, a seven-bit binary code maps into a three-digit octal code as 
follows: 


Equation: 


0000000 ~ 000 
0000001 ~ O01 


0100110 ~ 046 
101111 ~ 137 


1111111 ~ 177. 


The octal ASCII codes for representing letters, numbers, and special 
characters are tabulated in ‘Table 1. 
Exercise: 


Problem: Write out the seven-bit ASCII codes for A, g, 7, and{. 


0 1 2 3 4 5 6 We 
00x Ny ty i 0, Ny a 
01x : " tr " f z s ‘1 
'0O2x mh "e ns "6, °c “a Sy iz 


' c E S E F G R U 
03x Ay m UB re s s s 


‘04x °p I ? # $ % & 


'05x ( ) = + ‘ - / 
‘06x 0 1 Z 3 4 5 6 7 
'07x 8 9 : < = > 3) 
'10x @ A B C D E F G 
'11x H I J K L M N O 
EDX P Q R S T U V W 
'13x xX Y Z [ \ ] a = 
'14x . a b C d e f g 
'15x h i j k l m n O 
'16x p q r S t u V Ww 
'17x x y Z { | } ~ PE 


Octal ASCII Codes (from Donald E. Knuth, The TEXbook, ©1986 by the 
American Mathematical Society, Providence, Rhode Island p. 367, published 
by Addison-Wesley Publishing Co.) 


Exercise: 
Problem: 
Add a 1 or a 0 to the most significant (left-most) position of the seven-bit 
ASCII code to produce an eight-bit code that has even parity (even 


number of 1's). Give the resulting eight-bit ASCII codes and the 
corresponding three-digit octal codes for %, u, f, 8, and +. 


Quantizers and A/D Converters. What if the source alphabet is infinite? 
Our only hope is to approximate it with a finite collection of finite binary 
words. For example, suppose the output of the source is an analog voltage 
that lies between — Vo and +Vo. We might break this peak-to-peak range up 


into little voltage cells of size wr and approximate the voltage in each cell 


by its midpoint. This scheme is illustrated in Figure 1. In the figure, the cell 


C’; is defined to be the set of voltages that fall between ie i . and 
V 
e”MA+—M?*: 
Equation: 
2Vv Vi 2V Vi 
O,=4V20 = eV 5 ee. 
M M M 


The mapping from continuous values of V to a finite set of approximations is 
Equation: 


Q(V) =i, if VE C;. 


’ ati GOV) — 
That is, V is replaced by the quantized approximation 24,’ whenever V lies in 


cell C’;. We may represent the quantized values ny 0 with binary codes by 
simply representing the subscript of the cell by a binary word. In a subsequent 
course on digital electronics and microprocessors you will study A/D 
(analog-to-digital) converters for quantizing variables. 


A Quantizer 


Example: 

If M = 8, corresponding to a three-bit quantizer, we may associate quantizer 
cells and quantized levels with binary codes as follows: 

Equation: 


VeC3 => V3=(-3) 0111 
VeC, > V2=(-2)2~110 
VeC, = Vi=(- 8 A0r 
VEQ = Vo = 0 ~ 000 

vet > W=(1)~001 
VEC > Ww=(2)7~010 
VeCQ; => V3=(3)~011. 


This particular code is called a sign-magnitude code, wherein the most 
significant bit is a sign bit and the remaining bits are magnitude bits (e.g., 
110 ~ —2 and 010 ~ 2). One of the defects of the sign-magnitude code is 
that it wastes one code by using 000 for 0 and 100 for-O. An alternative code 
that has many other advantages is the 2's complement code. The 2's 
complement codes for positive numbers are the same as the sign-magnitude 
codes, but the codes for negative numbers are generated by complementing 
all bits for the corresponding positive number and adding 1: 

Equation: 


—4~ 100 
—3~101 (100+1) 
—2~110 (101+1) 
=P 1) (10 1) 
0 ~ 000 

1~ 001 

2~ 010 

3~ 011. 


Exercise: 


Problem: 


Generate the four-bit sign-magnitude and four-bit 2's complement binary 
codes for the numbers —8, —7,..., —1,0,1, 2,..., 7. 


Exercise: 


Problem: 


Prove that, in the 2’s complement representation, the binary codes for 
—nand + n sum to zero. For example, 
Equation: 


101 + 011 = 000 
(—3) (3) (0). 


In your courses on computer arithmetic you will learn how to do arithmetic in 
various binary-coded systems. The following problem illustrates how easy 
arithmetic is in 2's complement. 

Exercise: 


Problem: 


Generate a table of sums for all 2's complement numbers between —4. 
and +3. Show that the sums are correct. Use 
0+0=0,0+1=1,1+0 =1, and1 +1 = 0 witha carry into the 
next bit. For example, 001 + 001 = 010. 


Binary Trees and Variable-Length Codes. The codes we have constructed 
so far are constant-length codes for finite alphabets that contain exactly 

M = 2% symbols. In the case where M = 8 and N = 3, then the eight 
possible three-bit codes may be represented as leaves on the branching tree 
illustrated in Figure 2(a). The tree grows a left branch for a 0 and a right 
branch for a 1, until it terminates after three branchings. The three-bit codes 
we have studied so far reside at the terminating leaves of the binary tree. But 
what if our source alphabet contains just five symbols or letters? We can 
represent these five symbols as the three-bit symbols 000 through 100 on the 
binary tree. This generates a constant-length code with three unused, or 


illegal, symbols 101 through 111. These are marked with an "x" in Figure 
2(a). These unused leaves and the branches leading to them may be pruned to 
produce the binary tree of Figure 2(b). 


If we admit variable-length codes, then we have several other options for 
using a binary tree to construct binary codes. Two of these codes and their 
corresponding binary trees are illustrated in Figure 3. If we disabuse ourselves 
of the notion that each code word must contain three or fewer bits, then we 
may construct binary trees like those of Figure 4 and generate their 
corresponding binary codes. In Figure 4(a), we grow a right branch after each 
left branch and label each leaf with a code word. In Figure 4(b), we prune off 
the last right branch and associatea code word with the leaf on the last left 
branch. 


000 001 010 onl 100 101 110 117 900 001 010 oll 100 
(So) (S;) (S82) (Ss) (54) (2) (#)~— (a) (S09) (S,) (S22) (S83) (Sq) 


Binary Trees and Constant-Length Codes; (a) Binary Tree, and (b) 
Pruned Binary Tree 


All of the codes we have generated so far are organized in Table 2. For each 
code, the average number of bits/symbol is tabulated. This average ranges 
from 2.4 to 3.0. If all symbols are equally likely to appear, then the best 
variable-length code would be code 2. 


000 001 010 oll 100 101 110 111 000 001 010 O11 100 
(So) (8;) (82) (83) (84) (@) (a) (az) (So) (S8:) (So) (S53) (S84) 


Binary Trees and Variable-Length Codes; (a) Binary Tree for Variable- 
length Code, and (b) Another Binary Tree for Variable-length Code 


All of the codes we have constructed have a common characteristic: each 
code word is a terminating leaf on a binary tree, meaning that no code word 
lies along a limb of branches to another code word. We say that no code word 
is a prefix to another code word. This property makes each of the codes 
instantaneously decodable, meaning that each bit in a string of bits may be 
processed instantaneously (or independently) without dependence on 
subsequent bits. 

Exercise: 


Problem: Decode the following sequence of bits using code 2: 
Equation: 


0111001111000000101100111. 


Left-Handed Binary Trees for Variable-Length Codes; (a) Left-handed 
Binary Tree, and (b) Pruned Binary Tree 


Code Average 


# So S S» S3 S4 Bits/Symbol 
1 000 001 010 011 100 15/5 = 3.0 
2 000 001 01 10 11 12/5 = 2.4 
3 000 001 010 011 1 13/5 = 2.6 
4 1 01 001 0001 00001 15/5 = 3.0 
5 1 01 001 0001 0000 14/5 = 2.8 
Variable Length Codes 
Exercise: 
Problem: 


Illustrate the following codes on a binary tree. Which of them are 
instantaneously decodable? Which can be pruned and remain 
instantaneously decodable? 

Equation: 


So Si Soe S3 Sa 
011 100 OO 11 #101 
011 100 00 0 O01 
010 000 100 101 111. 


Code #2 generated in Table 2 seems like a better code than code #5 because 
its average number of bits/symbol (2.4) is smaller. But what if symbol So is a 
very likely symbol and symbol S44 is a very unlikely one? Then it may well 
turn out that the average number of bits used by code #5 is less than the 
average number used by code #2. So what is the best code? The answer 
depends on the relative frequency of use for each symbol. We explore this 
question in the next section. 


Binary Codes: Huffman Codes for Source Coding 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


In 1838, Samuel Morse was struggling with the problem of designing an 
efficient code for transmitting information over telegraph lines. He reasoned 
that an efficient code would use short code words for common letters and 
long code words for uncommon letters. (Can you see the profit motive at 
work?) In order to turn this reasoned principle into workable practice, 
Morse rummaged around in the composition trays for typeface in a 
printshop. He discovered that typesetters use many more e’s than s's. He 
then formed a table that showed the relative frequency with which each 
letter was used. His ingenious, variable-length Morse code assigned short 
codes to likely letters (like “dot” for e’) and long codes to unlikely letters 
(like “dash dash dot dot” for z’). We now know that Morse came within 
about 15% of the theoretical minimum for the average code word length for 
English language text. 


A Variation on Morse's Experiment. In order to set the stage for our study 
of efficient source codes, let's run a variation on Morse's experiment to see 
if we can independently arrive at a way of designing codes. Instead of 
giving ourselves a composition tray, let's start with a communication source 
that generates five symbols or letters So, $1, S2,.53, $4. We run the source 
for 100 transmissions and observe the following numbers of transmissions 
for each symbol: 

Equation: 


50 Sos 
20 Sis 
2055s 
5 Sis 
5 Sis. 


We will assume that these “source statistics” are typical, meaning that 1000 
transmissions would yield 500 ${s and so on. 


The most primitive binary code we could build for our source would use 
three bits for each symbol: 
Equation: 


So ~ 000 
S; ~ 001 
S2 ~ 010 
S3 ~ 011 
S4 ~ 100 
x ~ 101 
x ~ 110 
z~ 111. 


This code is inefficient in two ways. First, it leaves three illegal code words 
that correspond to no source symbol. Second, it uses the same code word 
length for an unlikely symbol (like $4) that it uses for a likely symbol (like 
So). The first defect we can correct by concatenating consecutive symbols 
into symbol blocks, or composite symbols. If we form a composite symbol 
consisting of M/Z source symbols, then a typical composite symbol is 

$1 S$095194525351S2509. The number of such composite symbols that can 
be generated is 5“. The binary code for these 5 composite symbols must 
contain N binary digits where 

Equation: 


ON) 25 29" (N &M log, 5). 


The number of bits per source symbol is 
Equation: 


N 
1) ee 
uM O89 


This scheme improves on the best variable length code of Table 2 from 
"Binary Codes: From Symbols to Binary Codes" by 0.08 bits/symbol. 
Exercise: 


Problem: 
Suppose your source of information generates the 26 lowercase roman 
letters used in English language text. These letters are to be 


concatenated into blocks of length M. Complete the following table of 
N (number of bits) versus M (number of letters in a block) and show 


N 
that —— approaches log. 26. 
M Pp 0) 


M 
1 2 3 4 fs) 6 
N fs) 10 


Now let's reason, as Morse did, that an efficient code would use short codes 
for likely symbols and long codes for unlikely symbols. Let's pick code #5 
from Table 2 from "Binary Codes: From Symbols to Binary Codes" for this 
purpose: 

Equation: 


So Si S» S3 S4 
1 O01 001 0001 OO0O0OD. 


This is a variable-length code. If we use this code on the 100 symbols that 
generated our source statistic, the average number of bits/symbol is 
Equation: 


ay (0 (1) + 20 (2) + 20 (3) +5 (4) + 5 (4)] = 1. 90bits /symbol. 


Exercise: 
Problem: 
Use the source statistics of Equation 1 to determine the average 


number of bits/symbol for each code in Table 2 from "Binary Codes: 
From Symbols to Binary Codes". 


Entropy. So far, each ad hoc scheme we have tried has produced an 
improvement in the average number of bits/symbol. How far can this go? 
The answer is given by Shannon's source coding theorem, which says that 
the minimum number of bits/symbol is 

Equation: 


M 


N 
Via - Di logs pi 


where p; is the probability that symbol S; is generated and — pi logs Di 
is a fundamental property of the source called entropy. For our five-symbol 


example, the table of p; and — log p; is given in Table 2. The entropy is 


1.861, and the bound on bits/symbol is 
Equation: 


> 1.861. 


Code #5 comes within 0.039 of this lower bound. As we will see in the 
next paragraphs, this is as close as we can come without coding composite 


symbols. 
Symbol Probability 
So 0.5 
Si 0.2 
So 02 
S3 0.05 
S4 0.05 


Source statistics for Five-Symbol Source 


Exercise: 


~ Log Probability 


4.32 


4.32 


Problem: 


Select an arbitrary page of English text. Build a table of source 
Statistics containing p, (relative frequencies) and-log p; for a through z 
. (Ignore distinction between upper and lower case and ignore 
punctuation and other special symbols.) Compute the entropy 

26 
= Di 10G5-p;. 

i=1 


Huffman Codes. In the late 1950s, David Huffman discovered an 
algorithm for designing variable-length codes that minimize the average 
number of bits/symbol. Huffman's algorithm uses a principle of optimality 
that says, “the optimal code for M letters has imbedded in it the optimal 
code for the M — 1 letters that result from aggregating the two least likely 
symbols.” When this principle is iterated, then we have an algorithm for 
generating the binary tree for a Huffman code: 


i. label all symbols as “children”; 
ii. “twin” the two least probable children and give the twin the sum of the 


probabilities: 
ta 
(54) (53) 
(0.05) (9:03) 


ili. (regard the twin as a child; and 
iv. repeat steps (ii) and (iii) until all children are accounted for. 


This tree is now labeled with 1's and 0's to obtain the Huffman code. The 
labeling procedure is to label each right branch with a 1 and each left 
branch with a 0. The procedure for laying out symbols and constructing 
Huffman trees and codes is illustrated in the following examples. 


Example: 
Consider the source statistics 
Equation: 


Symbol So S1 So S3 ot 
Probability 0.5 0.2 0.2 0.05 0.05 


for which the Huffman algorithm produces the following binary tree and its 
corresponding code: 


Example: 
The Huffman code for the source statistics 
Equation: 


Symbol So S1 So S3 S4 
Probability 0.75 0.075 0.075 0.05 0.05 


is illustrated next: 


(S4) 0.05 0.075 (S,) 
000 me). S59 O1l 
001 010 
Exercise: 
Problem: 


Generate binary trees and Huffman codes for the following source 
Statistics: 
Equation: 


Symbol So S S» S3 S4 S5 S6 S7 
Probability] 0.20 0.20 0.15 0.15 0.1 0.1 0.05 0.05 
Probability2 0.3 0.25 0.1 O.1 0.075 0.075 0.05 0.05. 


Coding a FAX Machine. Symbols can arise in unusual ways and be 
defined quite arbitrarily. To illustrate this point, we consider the design of a 


hypothetical FAX machine. For our design we will assume that a laser 
scanner reads a page of black and white text or pictures, producing a high 
voltage for a black spot and a low voltage for a white spot. We will also 
assume that the laser scanner resolves the page at 1024 lines, with 1024 
spots/line. This means that each page is represented by a two-dimensional 
array, or matrix, of pixels (picture elements), each pixel being 1 or 0. If we 
simply transmitted these l's and O’s, then we would need 

1024 x 1024 = 1, 059, 576 bits. If these were transmitted over a 9600 
baud phone line, then it would take almost 2 minutes to transmit the FAX. 
This is a long time. 


Let's think about a typical scan line for a printed page. It will contain long 
runs of 0's, corresponding to long runs of white, interrupted by short bursts 
of I's, corresponding to short runs of black where the scanner encounters a 
line or a part of a letter. So why not try to define a symbol to be “a run of k 
0's" and code these runs? The resulting code is called a “run length code.” 
Let's define eight symbols, corresponding to run lengths from 0 to 7 (a run 
length of 0 is a 1): 

Equation: 


So = run length of 0 zeros (a 1) 
S.- = run length of 1 zero 


S7 = run legnth of 7 zeros. 


If we simply used a simple three-bit binary code for these eight symbols, 
then for each scan line we would generate anywhere from 3 x 1024 bits 
(for a scan line consisting of all 1's) to 3 x 1024/7 = 400 bits (for a scan 
line consisting of all 0's). But what if we ran an experiment to determine the 
relative frequency of the run lengths Sp through S7 and used a Huffman 
code to “run length encode” the run lengths? The following problem 
explores this possibility and produces an efficient FAX code. 

Exercise: 


Problem: 


An experiment conducted on FAXed documents produces the 
following statistics for run lengths of white ranging from 0 to 7: 
Equation: 


Symbol So S1 S» S3 S4 Ss S6 S7 
Probability 0.01 0.06 0.1 0.1 0.2 0.15 0.15 0.2 


These statistics indicate that only 1% of a typical page is black. 
Construct the Huffman code for this source. Use your Huffman code to 
code and decode these scan lines: 


[Kon © tt eee es See 


Binary Codes: Hamming Codes for Channel Coding 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


The idea behind Hamming codes is to intersperse, or append, extra binary 
digits to a binary code so that errors in transmission of the code over a 
channel may be detected and corrected. For example, suppose we transmit 
the code 01101001, and it is received as 01001001. In this transmission, the 
third most significant bit is received erroneously. Let's define the following 
“modulo-2 addition” of binary numbers: 


Equation: 
0¢0=0 
0@G1=1 
160=1 
161=0. 


Multiplication in modulo-2 arithmetic is simply 0-0 = 0-1=1-0=0 
and 1 -1 = 1. Then we can say that the error sequence 00100000 is “added” 
to the transmission 01101001 to produce the erroneous reception: 
Equation: 


01101001 transmitted 
@® 00100000 error 
01001001 received. 


Hamming error correcting codes will permit us to receive the erroneous 
transmission and to detect and correct the error. This is obviously of great 
value in transmitting and storing information. (Imagine how upset you 
would be to have the binary code for your checking account confused with 
that of Mrs. Joan Kroc.) 


Choosing the Number of Check Bits. Let's suppose we have N bits of 
information that we wish to transmit and that we wish to intersperse “check 
bits” that will enable us to detect and correct any single bit error in the 
transmission. If we use NV information bits and n check bits, then we will 
transmit a code word containing N + n bits. The n check bits can code 2” 
events, and we want these events to indicate whether or not any errors 
occurred and, if so, where they occurred. Therefore we require 


where (V + 7) is the number of single error events that can occur and +1 is 
the number of no-error events. For example, when NV = 4, we require 
n = 3s0 that 2? > (44+ 3) +1. 


How many check bits do you require to code seven bits of information for 
single error correction? 


Code Construction. Let's suppose we have constructed an (V, n) 
Hamming code consisting of NV information bits and n check bits (or parity 
bits). We denote the information bits by x1, £2, ..., zy and the check bits by 
C1, C2, ---) Cn. These bits may be interspersed. When NV = 4 and n = 3, then 
a typical array of bits within a code word would be one of the following: 
Equation: 


Ci LI 
C2 T2 
LY L3 


C3 or LA 
2 C1 
L3 C2 
LA C3 


The first ordering is “natural” (as we will see), and the second is 
“systematic” (a term that is used to describe any code whose head is 
information and whose tail is check). If a single error occurs in an (NV, n) 
code, then the received code word will be the modulo-2 sum of the code 
word and the error word that contains a 1 in its i** position: 

Equation: 


Ci 
C2 
ZL) 


L2 
L3 


ey 
w 
D 

eo Ns > A = a = 


L4 


We would like to operate on this received code word in such a way that the 
location of the error bit can be determined. If there were no code word, then 
an obvious solution would be to premultiply the error word by the parity 
check matrix 

Equation: 


The i** column of A” is just the binary code for 7. When A? premultiplies 
an error word, the error bit picks out the column that codes the error 
position: 


103103101 0 
ott 0 0 1 1 0 
0001111 1 

position | 


binary code for 


0 

0 

0 
fae f= 

0 

0 position 4 

0 


If the error word contains no error bits, then the product is 0, indicating no 
eIrors. 


This seems like a good idea, but what about the effect of the code word? In 
Exercise 1, you are asked to show that the effect of the parity check matrix 
A? applied to the modulo-2 sum of a code word x and an error word e is 
Equation: 


A’ (x@e) = A’x@ A’e. 


In this equation all sums and products obey the rules of modulo-2 
arithmetic. 
Exercise: 


Problem: 
Let y = x @ e denote the modulo-2 sum of a code word x and an error 
word e; A? is a parity check matrix. Show that 


Equation: 


Aty = A’x6 Ale. 


We have designed the parity check matrix A” so that the syndromeATe 
produces a binary code for the error location. (The location of the error is 


th’e syndrome for the error word.) The product A?x will interfere with this 
syndrome unless A? z = 0. Therefore we will require that the code word 
satisfy the constraint 

Equation: 


A’x =0. 


This constraint actually defines the Hamming code. Let's illustrate this point 
by applying the constraint to a code word in its “natural format” x? = 
(CyCoX 103% 2X34). 


Natural Codes. When the information bits and the check bits are coded in 
their natural order (c1c2%1¢3%2%324), then we may determine the check bits 
by writing Ax as follows: 

Equation: 


Ci 
C2 
L1 


C3 = 


oe 
oro 
Orr 
Ke oO 
eH Or 
ee © 
SS eS 
Se: OS 


sy) 
3 
D4 


We use the rules of modulo-2 arithmetic to write these constraints as 
Equation: 


C1027, 0%2087%,=0 
CoB 2102730 274 = 0 
c3 8B X2O730 7%, = O. 


Therefore the check bits c1, cz, and c3 are simply the following modulo-2 
sums 


Equation: 


Cy = 7% OX20 44 
C2 = %1 OxX3 0 4 
C3 = Fa PMO L4. 


This finding may be organized into the matrix equation 


Equation: 


C1 
C2 
L) 
C3 
L2 
L3 


L4 


This equation shows how the code word z is built from the information bits 
(21, £2, £3, £4). We call the matrix that defines the construction a coder 


matrix and write it as H: 
Equation: 


Equation: 


1101 
1011 
1000 
O111 
0100 
0010 
0001 


xz — HO 


xP = (C1C9X 103222324) et = (21220324) 


L1 
v2 
v3 
4 


1101 
1011 
1000 
H= 0111 
0100 
0010 
0001 


This summarizes the construction of a Hamming code z. 
Exercise: 


Problem: 
Check to see that the product of the parity check matrix A? and the 
coder matrix H is A7H = 0. Interpret this result. 
Exercise: 
Problem: 


Fill in the following table to show what the Hamming (4, 3) code is: 
Equation: 


8 
pa 
8 
i) 
8 
i) 


LZ4 Cy Co Ly C3 LQ LZ LJ 
0 0 0 0 0 0 0 =O 


0 0 O 
0 0 0 1 1 1 0 1 0 0 =41 
0 0 1 O 
0 O 1 1 
0 1 0 O 
0 1 0 1 
0 1 1 O 
0 1 1 1 
1 0 0 O 
1 0 0 1 
1 0 1 O 
1 O 1 1 
1 1 0 0 
1 1 0 1 
1 1 1 =O 
1 1 1 #1 

Exercise: 

Problem: 


Design a Hamming (11, 7) code for coding eleven information bits 
against single errors. Show your equations for cj, C2, ..., Gn and write 
out the coder matrix H for x = HO. 


Decoding. To decode a Hamming code, we form the syndrome A‘y for the 
received (and possibly erroneous) code word y = x @ e. Because A? x = 0, 
the syndrome is 

Equation: 


s— Ate. 


Convert this binary number into its corresponding integer location and 
change the bit of y in that location. If the location is zero, do nothing. Now 
strip off the information bits. This is the decoding algorithm. 

Exercise: 


Problem: 


Use the table of Hamming (4,3) codes from Exercise 3 to construct a 
table of received codes that contain either no bit errors or exactly one 
bit error. Apply the decoding algorithm to construct (21, 2, £3, 24) 
and show that all received code words with one or fewer errors are 
correctly decoded. 


Digital Hardware. The tables you have constructed in Exercise 3 and 5 for 
coding and decoding Hamming (4, 3) codes may be stored in digital logic 
chips. Their functionality is illustrated in Figure 1. The coder chip accepts 
(21222324) as its address and generates a coded word. The decoder chip 
accepts (C1C2%1C3%2%324) as its address and generates a decoded word. In 


your courses on digital logic you will study circuits for implementing 
coders and decoders. 


DL Cy Cc, zy 
C2 
Xo LZ, 2X L2 
Coder Cz Cz Decoder 
oe Lz 22 Ls 
Lz Ws 
L4 Le % Va 


Digital Logic for Hamming Code 


Exercise: 
Problem: 
Discuss the possibility of detecting a received (4, 3) code word that is 


neither a valid code word nor a code word with a single error. How 
would you use such a detector? 


Exercise: 


Problem: 


What fraction of received seven-bit words can be correctly decoded as 
Hamming (4, 3) codes? 


Systematic Codes. Systematic Hamming codes are codes whose 
information bits lead and whose check bits trail. The format for a (4, 3) 
code is then (#122%324C1C2C3). The construction of a (4,3) code word from 
the information bits may be written as 


Equation: 


Ty 
TQ 
L3 
L4 
Cy 
Co 
C3 


The coder matrix takes the form 


Equation: 


oO F & 


LI 
X92 
L3 


L4 


The problem is to find the matrix C that defines the construction of check 
bits. The constraint A7x = 0 produces the constraint A7H = 0 so that 
AHO = 0. The constraints A7H = 0 may be written out as 

Equation: 


1 0O O 0 

0 1 0 O 
10101041 0 0 1 O 00 0 0 
0110011 = 0 0 0 1 2+ 0000. 
000%1t1i11 Ci Cyn «C13 «C44 0 0 0 0 


C21 C22 C23 Co4 
C31 C32 C33 C34 


These constraints produce all the equations we need (twelve equations in 
twelve unknowns) to determine the c;,;. 
Exercise: 


Problem: 


Solve Equation 19 for the c;;. Show that the coder matrix for a 
systematic Hamming (4, 3) code is 


Equation: 
1 0 0 0 
010 0 
00 1 0 
H= 0001 
0111 
1011 
1101 


Exercise: 


Problem: 


Show that the coder matrix of Exercise 7 is a permutation of the coder 
matrix in Equation 14. (That is, the rows are reordered.) 


Exercise: 


Problem: 


(MATLAB) Write a MATLAB program that builds Hamming (4, 3) 
codes from information bits (21222324) and decodes Hamming (4, 3) 
codes (c1c2%1¢3%2%324) to obtain information bits (#1727324). 
Synthesize all seven-bit binary codes and show that your decoder 
correctly decodes correct codes and one-bit error codes. 


Binary Codes: Numerical Experiment (Huffman Codes) 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


The paragraph below contains Lincoln's Gettysburg Address. Ignore special 
symbols like periods and ignore the distinction between lowercase and 
uppercase letters. Compute the relative frequency of occurrence for each of 
the 26 letters of the alphabet. Write a MATLAB program to generate a 
binary tree for the Huffman code of the Gettysburg Address. When you 
have generated the code, compute the average code word length 

Equation: 


where a is the relative frequency for symbol z and /; is the code word 
length for symbol 7. Compare L with the entropy 


Equation: 
26 te 26 
d, _ N N a Nj 


and compare L to log, 26. Interpret your findings. 


Gettysburg Address 


Fourscore and seven years ago, our fathers brought 
forth upon this 

continent a new nation, conceived in liberty and 
dedicated to the proposition 

that all men are created equal. Now we are engaged 
in a great civil war, 

testing whether that nation or any nation so 
conceived and so dedicated can 

long endure. We are met on a great battlefield of 
that war. We have come 

to dedicate a portion of that field as a final 
resting-place for those who have 

given their lives that that nation might live. It 
is altogether fitting and proper 

that we should do this. But in a larger sense, we 
cannot dedicate, we cannot 

consecrate, we cannot hallow this ground. The 
brave men, living and dead, 

who struggled here have consecrated it far above 
Our power to add or detract. 

The world will little note nor long remember what 
we say here, but it can 

never forget what they did here. It is for us the 
living rather to be dedicated 

here to the unfinished work which they who fought 
here have thus far so nobly 

advanced. It is rather for us to be here dedicated 
to the great task remaining 

before us-that from these honored dead we take 
increased devotion to that 

cause for which they gave the last full measure of 
devotion-that we here 

highly resolve that these dead should not have 
died in vain, that this nation 

under God shall have a new birth of freedom, and 
that government of the 


people, by the people, for the people shall not 
perish from the Earth. 
-Abraham Lincoln, November 19, 1863 


An Introduction to MATLAB: Introduction 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


Acknowledgment: This appendix, written with assistance from Cédric J. 
Demeure and Peter Massey, was inspired by the MATLAB user's manual 
from The MATHWORKS, Inc. The MA TLAB Primer, available through 
the MATLAB User's Group, is a useful learning aid for teachers and 
students. To join the MATLAB User's Group, send your request via E-mail 
to matlab-users request@mcs.anl.gov. 


Introduction 


MATLAB stands for “Matrix Laboratory.” It is a computing environment 
specifically designed for matrix computations. The program is ideally 
suited to circuit analysis, signal processing, filter design, control system 
analysis, and much more. Beyond that, its versatility with complex numbers 
and graphics makes it an attractive choice for many other programming 
tasks. MATLAB can be thought of as a programming language like 
PASCAL, FORTRAN, C, or BASIC. Like most versions of BASIC, 
MATLAB can be used in an interactive mode wherein statements are 
executed immediately as they are typed. Alternatively, a program can be 
written in advance and saved to a disc file using an editor and then executed 
in MATLAB. You will find both modes of operation useful. 


An Introduction to MATLAB: Running MATLAB (Macintosh) 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


In order to run MATLAB on a Macintosh SE or PLUS computer, you need 
the program called EDU-MATLAB. The program requires at least 1 Mbyte 
of memory, System 3.0 or above, Finder version 3.0 or above, and an 800K 
drive. A hard disc drive is highly recommended. In order to run MATLAB 

ona Macintosh II, IIx, IIcx, or SE/30, you need the program called MaclII- 
MATLAB, and the same system requirements apply. 


To start MATLAB, you may need to open the folder containing the 
MATLAB program. Then just “double-click” the program icon or the 
program name (for example, EDU-MATLAB). Figure A.1 shows a typical 
organization of the folder containing Mac II-MATLAB. It contains the main 
program, the settings file, the demonstrations folder, and any toolbox 
folders. The double-click on Mac II-MATLAB produces the Command 
window as shown in Figure A.2. You will also see a Graph window 
partially hidden behind it. (The fact that the window is not in ffont means 
that it is opened but not currently active.) If you do not know what 
“clicking,” “dragging,” “pop-up menu,” and “trash” mean, you should stop 
reading now and familiarize yourself with the Macintosh. 
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The MATLAB Folder 
(©Apple Computer, Inc., 
used with permission.) 


In the command window, you should see the prompt >>. The program 
interpreter is waiting for you to enter instructions. At this point it is a good 
idea to run the demonstration programs that are available in the “About 
MATLAB” menu under the Apple menu. Just click on the “demos” button 
and select a demo. During pauses, strike any key to continue. Whenever 
you have a MATLAB file in any folder, then you may double-click the file 
to launch the program. This allows you to have your own folder containing 
your own MATLAB files, separated from the MATLAB folder. 
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The Command Window 
(©Apple Computer, Inc., 
used with permission. ) 


MATLAB has four types of windows: 


i. Command for computing, programming, and designing input/output 
displays; 
ii. Graph for displaying plots and graphs; 
iii. Edit for creating and modifying your own files; and 
iv. Help for getting on-line help and for running demos. 


All windows follow the traditional behavior of Macintosh windows. You 
can resize them (actually the help window has a fixed size) or move them. 
For more details on menus and windows, see the Macintosh and MATLAB 
manuals. 


An Introduction to MATLAB: Running MATLAB (PC) 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


In order to run MATLAB (Version 3.5) on an IBM or compatible personal 
computer, you must have a floating point math coprocessor 

installed and at least 512 kbytes of memory. The program is called PCMAT- 
LAB.EXE, but it is usually invoked via the batch file MATLAB.BAT in the 
MATLAB subdirectory. If you are using a menu system and MATLAB is 
one of your choices, just choose it. Otherwise, go to the MATLAB 
suiUirectory and type MATLAB. 


You may be able to usea more powerful implementation of MATLAB if you 
have an 80286 or 80386 machine. AT-MATLAB runs on an 80286 with at 
least 1 Mbyte of extended memory. AI-MATLAB is distributed with PC- 
MATLAB. 386-MATLAB, a special version for 80386 or 80486 machines 
with virtual memory support and no limits on variable size, is sold 
separately. 


When you run MATLAB, you should see the prompt. The program 
interpreter is waiting for you to enter instructions. Some MATLAB 
instructions, such as plot, are graphics-type instructions which plot results 
and data. Execution of one of these graphics instructions puts the PC screen 
into the graphics mode, which displays the resulting plot. No instructions 
can be executed in the graphics mode other than a screen-dump function. 
Striking any other key will return the PC to the command mode, but the 
graphics are temporarily stored (like variables) and can be recalled by the 
shg (show graphics) instruction. If you wish, you may run some of the 
demonstration programs now by entering demo and following the on-screen 
instructions. 
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In command mode, MATLAB displays a prompt and waits for your 
input. You may type any legal mathematical expression for immediate 
evaluation. Try the following three examples (press “enter” or “return” at 
the end of each line): 


> 24+2 
> 5A2 
> 2*sin(pi/4) 


The variable p1 = 3.14 is built into MATLAB, as are the Sin function 
and hundreds of other functions. When you entered each of the preceding 
lines, MATLAB stored the results in a variable called ans for answer. The 
value of ans was then displayed. The last line should have produced the 
square root of 2. We can manipulate ans to find out 


> ansA2 


The new answer is very close to 2, as expected. Let's see what the roundoff 
error is: 


> ans-2 
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Any result you wish to keep for a while may be assigned to a variable other 
than ans: 


> X = pi/7 


> cos(x) 
> y = sin(x)42+cos(x)2; 
> y 


A semicolon (;) at the end of the line suppresses printing of the result, as 
when we calculated y in the next-to-last line just shown. This feature is 
especially useful when writing MATLAB programs where intermediate 
results are not of interest and when working with large matrices. 


MATLAB supports the dynamic creation of variables. You can create your 
own variables by just assigning a value to a variable. For example, type X 

= 3.5+4.2. Then the real variable X contains the value 7 . 7. Variable 
names must start with an alphabetical character and be less than nineteen 
characters long. If you type X = -3%*4.0, the content 7.7 is replaced by 
the value -12. Some commands allow you to keep track of all the variables 
that you have already created in your session. Type who or whos to get the 
list and names of the variables currently in memory (whOS gives more 
information than who). To clear all the variables, type in clear. To clear a 
single variable (or several) from the list, follow the command clear by 
the name of the variable you want to delete or by a list of variable names 
separated by spaces. Try it now. 


MATLAB is case sensitive. In other words, z and X are two different 
variables. You can control the case sensitivity of MATLAB by entering the 
command casesen, which toggles the sensitivity. The command 
casesen on enforces case sensitivity, and casesen off cancels it. 


If one line is not enough to enter your command, then finish the first line 
with two dots (.. ) and continue on the next line. You can enter more than 
one command per line by separating them with commas if you want the 
result displayed or with semicolons if you do not want the result displayed. 
For example, type 


> theta = pi/7; x = cos(theta); y = sin(theta); 
> x,y 


to first compute theta,cos( theta), and Sin( theta) and then to 
print x and y. 
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The number »/—1 is predefined in MATLAB and stored in the two variable 
locations denoted by i and j. This double definition comes from the 
preference of mathematicians for using 7 and the preference of engineers for 
using 7 (with z denoting electrical current). 1 and j are variables, and their 
contents may be changed. If you type j = 5, then this is the value for j 
and j no longer contains \/—1. Type in j} = sqrt(-1) to restore the 
original value. Note the way a complex variable is displayed. If you type 1, 
you should get the answer 


1.= 
0+1.00001. 


The same value will be displayed for J. Try it. Using j, you can now enter 
complex variables. For example, enter Z1 = 1+2*j andz2 = 
2+1.5%*j. As j is a variable, you have to use the multiplication sign *. 
Otherwise, you will get an error message. MATLAB does not differentiate 
(except in storage) between a real and a complex variable. Therefore 
variables may be added, subtracted, multiplied, or even divided. For 
example, typeinxX = 2, Z = 4.5*j, and Z/x. The real and 
imaginary parts of Z are both divided by x. MATLAB just treats the real 
variable X as a complex variable with a zero imaginary part. A complex 
variable that happens to have a zero imaginary part is treated like a real 
variable. Subtract 2* j from Z1 and display the result. 


MATLAB contains several built-in functions to manipulate complex 
numbers. For example, real (Z) extracts the real part of the complex 
number Z. Type 


> Z = 24+1.5*j, real(z) 
to get the result 


Ze= 
2.000+1.5001 


ans = 
2 


Similarly, imag(Z) extracts the imaginary part of the complex number Z. 
The functions abS(Z) and angle(z) compute the absolute value 
(magnitude) of the complex number Z and its angle (in radians). For 
example, type 


> Z = 2+2*7j; 

> r = abs(z) 

> theta = angle(z) 
> Z = r*exp(j*theta) 


The last command shows how to get back the original complex number 
from its magnitude and angle. This is clarified in Chapter 1: Complex 
Numbers. 


Another useful function, conj (Z), returns the complex conjugate of the 
complex number Z. If Z = X+]*y where X and y are real, then conj 
(Z) is equal to X-]*y. Verify this for several complex numbers by using 
the function conj (Z). 
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As its name indicates, MATLAB is especially designed to handle matrices. 
The simplest way to enter a matrix is to use an explicit list. In the list, the 
elements are separated by blanks or commas, and the semicolon (;) is used 
to indicate the end of a row. The list is surrounded by square brackets [| ]. 
For example, the statement 


> A= [1 2 3;4 5 6;7 8 Q] 


results in the output 


The variable A is a matrix of size 3 x 3. Matrix elements can be any 
MATLAB expression. For example, the command 


> x = [-1.3 sqrt(3) (1+2+3)*4/5] 
results in the matrix 


xX = 
-1.3000 1.7321 4.8000 


We call a matrix with just one row or one column a vector, andal x 1 
matrix is a scalar. Individual matrix elements can be referenced with indices 
that are placed inside parentheses. Type x(5) = abs(x(1)) to produce 
the new vector 


X = 
-1.3000 1.7321 4.8000 0.000 1.3000 


Note that the size of x has been automatically adjusted to accommodate the 
new element and that elements not referenced are set equal to 0 (here x(4)). 
New rows or columns can be added very easily. Try typing r = [10 11 
12],A = [A;1r]. Dimensions in the command must coincide. Try r = 
[13 14],A = [A;r]. 


The command size(A) gives the number of rows and the number of 
columns of A. The output from $1ze(A) is itself a matrix of size 1 x 2. 
These numbers can be stored if necessary by the command [m n] = 
$ize(A). In our previous example, A = [A;r]isa4 x 3 matrix, so the 
variable m will contain the number 4 and n will contain the number 3. A 
vector is a matrix for which either m or nN is equal to 1. If m is equal to 1, the 
matrix is a row vector; if N is equal to 1, the matrix is a column vector. 
Matrices and vectors may contain complex numbers. For example, the 
statement 


> A= [1 2;3 4]+]*[5 6;7 8] 

and the statement 

>> A = [1+5*j 2+6*j5;3+7*j 4+8*j] 

are equivalent, and they both produce the matrix 

A = 
1.0000+5.00001 2.0000+6.00001 
3.0000+7 .00001 4.0000+8 .00001 


Note that blanks must be avoided in the second expression for A. Try typing 


>A = [1 + 5*jJ 2 + 6*j 2 + 6*5;3 +7*j3 4 + 8*7F] 
What is the size of A now? 


MATLAB has several built-in functions to manipulate matrices. The special 
character, ', for prime denotes the transpose of a matrix. The statement A = 
[ 1 2 3;4 5 6;7 8 9]' produces the matrix 


The rows of A' are the column of A, and vice versa. If A is a complex 
matrix, then A is its complex conjugate transpose or hermitian transpose. 
For an “unconjugate” transpose, use the two-character operator dot-prime (. 
'). Matrix and vector variables can be added, subtracted, and multiplied as 
regular variables if the sizes match. Only matrices of the same size can be 
added or subtracted. There is, however, an easy way to add or subtract a 
common scalar from each element of a matrix. For example, x = [1 2 3 
4|,X = X-1 produces the output 


 —— 
1234 


a 
O12s 


As discussed in the chapter on linear algebra, multiplication of two matrices 
is only valid if the inner sizes of the matrices are equal. In other words, A*B 
is valid if the second size of A (number of columns) is the same as the first 
size of B (number of rows). Let a;,; represent the element of A in the 7" 
row and the j** column. Then the matrix A*B consists of elements 
Equation: 


(AB),,;=  — aikdy,j 


where n is the number of columns of A and the number of rows of B. Try 
typingA = [1 2 3;4 5 6];B = [7;8;9]; A*B. You should get 
the result 


ans = 
50 
112 


The inner product between two column vectors X and y is the scalar defined 
as the product x ' *y or equivalently as y' *X For example, X = 
[1;2],y = [3;4],x'*y, leads to the result 


ans = 
11 


Similarly, for row vectors the inner product is defined as x*y'. The 
Euclidean norm of a vector is defined as the square root of the inner product 
between a vector and itself. Try to compute the norm of the vector [1 2 3 
4]. You should get 5.4772. The outer product of two column (row) 
vectors is the matrix x*y' (x'*y). 


Any scalar can multiply or be multiplied by a matrix. The multiplication is 
then performed element by element. TryA = [1 2 3;4 5 6;7 8 
9];A%*2. You should get 


ans = 
2 4 6 
8 10 12 
14 26 28 


Verify that 2*A gives the same result. 


The inverse of a matrix is computed by using the function inv(A) and is 
only valid if A is square. If the matrix is singular, meaning that it has no 
inverse, a message will appear. Try typing 1nv(A). You should get 


Warning: Matrix is close to singular or badly scal 
ed. 
Results may be inaccurate. RCOND=2.937385e-18 


ans = 
1.0e+16* 
0.3152 -0.6304 O38 152 
-0.6304 1.2609 -0.6304 
0.3152 -0.6304 0.3152 


The inverse of a matrix may be used to solve a linear system of equations. 
For example, to solve the system 


you could type A> = [12 372 =2 4°00 -2 1) b — [27773]; 
inv(A)”*b and get 


ans = 
1 
-1 
1 
Equation: 
12 3 Ly 2 
1 -2 4 LQ 0 >= 1 ; 
0 -—2 1 £3 3 


Check to see that this is the correct answer by typing A* [1; -1;1]. What 
do you see? 


MATLAB offers another way to solve linear systems, based on Gauss 
elimination, that is faster than computing the inverse. The syntax is A\b 
and is valid whenever A has the same number of rows as D. Try it. 


The “Dot” Operator. Sometimes you may want to perform an operation 
element by element. In MATLAB, these element-by-element operations are 


called array operations. Of course, matrix addition and subtraction are 
already element-by-element operations. The operation A. *B denotes the 
multiplication, element by element, of the matrices A and B. Make two 3 
x3 matrices, A and B, and try 


> A*B 
> A.*B 


Suppose we want to find the square of each number in A. The proper way to 
specify this calculation is 


> A_squared=A.42 


where the period (dot) indicates an “array operation” to be performed on 
each element of A. Without the dot, A is multiplied by A according to the 
rules of matrix multiplication described in Chapter 4, giving a totally 
different result: 


> ANr2 


Subtleties. Because MATLAB can do so many different mathematical 
functions with just a few keystrokes, there are times when a very slight 
change in what you type will lead to a different result. Using the matrix A 
entered earlier, type the following two lines: 


> 2.4A 
> 2 .AA %with a space after 
the 2. 


In the first case, the dot is “absorbed” by the 2 as a decimal point and the “ 
is taken as a matrix exponential. But, when the dot is separated from the 2 
by a space, it becomes part of the operator (. *) and specifies that 2 should 
be raised to the power of each element in A. The point is, you should be 
very careful to type what you mean in an unambiguous way until you are 
familiar enough with MATLAB to know how the subtle situations will be 
interpreted. An unambiguous way of typing the preceding lines is 


> (2.)4A %for matrix 
exponential 
> (2).4A %for array 
exponential. 
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You can use the colon several ways in MATLAB (see help :). Its basic 
meaning is a vector of sequential values. For example, type 


> xX = 3:9 
to get 


xX = 
3456789 


For increments other than 1, use statements like 


> x = 1:0.5:4 
> x = 6:-1:0 


Most MATLAB functions will accept vector inputs and produce vector out- 
puts. The statement 


> y = sqrt(1:10) 


builds a vector of integers from 1 to 10 and takes the square root of each of 
those numbers. Try it. 


Now for another subtlety—what is the effect of each of the following 
statements and why? 


> 141:5 
>> 1+(1:5) 


Appending to a Matrix or Vector. A matrix or vector can be enlarged in 
size by appending new values to the old values. Let x=[1 3 5]: 
Equation: 


x = [x6810] > x = [1356810] 
13 5 6 8 10 


¥ = leo) Sy = (ee ae el 
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In "Complex Numbers", you learn that complex variables can be 
represented as points in the plane. MATLAB makes it easy for you to plot 
complex variables in a graph. Type Z1 = 1+.5*j;plot(zi,'o'). 
The graph window should be activated and the point Z1 displayed by a 'o'. 
You must specify the symbol for display, and the authorized symbols for 
point display are'.', 'O', ‘+', and '*'. When you are displaying a 
curve (to come later), no type is necessary. MATLAB automatically adjusts 
the scale on a graph to accommodate the value of the point being plotted. In 
this case, the range is [0, 2] for the real part and [0, 1] for the imaginary 
part. 


Let's now plot a second complex number by typing Z2 = 2+1.5%*j; 
plot(za, 'o' ). Note that the second plot command erases the first 
plot and changes the scaling to [0, 4] and [0,3]. Sometimes you may want 
to have the points plotted on the same graph. To achieve this, you have to 
use the command hold on after the first plot. Try the following: 


> plot(z2,'0') 
> hold on 
> plot(zi,'o') 
> hold off 


The advantage in using the hold command is that there is no limit to the 
number of plot commands you can type before the hold is turned off, and 
these plots may involve the same variable plotted over a range of values. 


You can also use different point displays. A disadvantage of the hold 
command is that the scaling is enforced by the first plot and is not adjusted 
for subsequent plots. This is why we plot the point Z2 first. Try reversing 
the order of the plots and see what happens. This means that points outside 
the scaling will not be displayed. The command hold off permits 
erasing the current graph for the next plot command. 


You can freeze the scaling of the graph by using the command axis. 
MATLAB gives you the message 


Axis scales frozen 
ans = 
040 3 


This freezes the current axis scaling for subsequent plots. Similarly, if you 
type axis a second time, MATLAB resumes the automatic scaling feature 
and prints the message 


Axis scales auto-ranged 
ans = 
040 3 


The axis scaling can also be manually enforced by using the command 
>> axis([xmin xmax ymin ymax] ) 


where (Xmin, ymin) is the lower left corner and (xmax, ymax) is 
the upper right corner of the graph. This scaling remains in effect until the 
next AX1S command is entered (with or without arguments). 


Another way to plot several complex numbers on the same graph is to 
display them as a curve. For this purpose, you have to store the numbers in 
a vector. For example, type Z1 = z1,z(2) = z2,plot(z). Note that 
the two points are at the two extremes of the line plotted on the graph. If 
you specify a symbol, then no line is drawn, just the extreme points. Try 
DGrZ Ors) . 


If you examine your current graph carefully, you will notice that the unit 
lengths on the x and y axes are not quite the same. In fact, MATLAB adjusts 
the length of an axis to conform to the overall size of the graph window. 
What this means is that a 45° line will actually be displayed at an angle 
depending on the overall aspect ratio of the graph window. To ensure that 
the aspect ratio is equal to 1, you may enter the command 
axis('square' ). MATLAB will then enforce an aspect ratio equal to 1, 
regardless of the aspect ratio for the outside graph window. This ensures 
that circles appear as circles and not as ellipses. MATLAB will make the 
square graph as large as possible to fit within the graph window. To go back 
to the default ratio, just type in axis('normal' ). 


To add labels to your graph, the functions xlLabel( 'text'), 
ylabel('text'), andtitle('text' ) are useful and self- 
explanatory. The argument text contains a string of characters. Add the 
label Real on the horizontal axis and the label Imaginary on the vertical 
axis of your graph. The command grid draws a grid on your graph. The 
grid does not remain in effect for the next plots. Try it. 


The plot Instruction. The plot instruction in MATLAB is very versatile. It 
can be used to plot several different types of data. Its syntax is 
plot(x,y,' symbol and/or color') orplot(y, 'symbol 
and/or color' ). The instruction will plot a vector of data versus 
another vector of data. The first vector is referenced to the horizontal axis 
and the second to the vertical axis. If only one vector is used, then it is 
plotted with reference to the vertical axis while the horizontal axis is 
automatically forced to be the index of the vector for the corresponding data 
point. The notation inside the apostrophes is optionally used to designate 
whether each element of the vector is to be plotted as a single point with a 
certain symbol or as a curve with a straight line drawn between each data 
value. The colors can also be specified. Possible symbols are (*,0,+, . ), 
and colors are (r, 9, 0, W) (red, green, blue, and white). Complex valued 
vectors are plotted by making the horizontal axis the real part of the vector 
and the vertical axis the imaginary part. Warning: A complex valued vector 
will automatically be plotted correctly on the complex plane (instead of real 
versus imaginary) only if every element of the vector is complex valued. 
Try POE OG 2). PLOENCG ). Bloeity, ))), plot 


(7 land plot (z=), prot GReal(Z),. imag 
(Zz), '*' ) to clarify your understanding of plot. Use 
x = [1357], y = [2468),z =[1+ 7,2 + 27,3 + 3]]. 


We may summarize as follows: 


plot(x, '*r') 


plot(y) 


plot(x,y, '0g') 
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values of 


x 
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(line—connected curve 
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y 


on vertical and the value 
of 


x 
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(line—connected curve 
of the value of 


y 


on vertical and the value 
of 


plot(x,y, '0g') 


plot(real(z),imag(z) ) 


plot(real(z),imag(z),'+b' ) 


x 
on horizontal) 


(circle—points of the 
value of 


y 


on vertical and the value 
of 


x 

on horizontal) 
(line—connectedplot of 
Z 

on the complex plane) 
(blue plus—points of 

Z 


on the complex plane). 


subplot( 223) at 


Subplot( 2 12) 7 


Subplots. It is possible to split the graphics screen up into several separate 
smaller graphs rather than just one large graph. As many as four subplots 
can be created. The MATLAB instruction Subplot (mnp) signifies 
which of the smaller graphs is to be accessed with the next plot statement. 
The mnp argument consists of three digits. The m and n are the numbers of 
rows (m) and columns (nN) into which the screen should be divided. The p 
designates which of the matrix elements is to be used. For example, 


subpiot( 1 21) 


Help and Demos. MATLAB has on-line help and a collection of 
demonstrations. For a list of available functions, type 


> help 
For help on a specific function, Sin for example, type 
> help sin 


To learn how to use colon (;, a very important and versatile character) in 
MATLAB, type 


> help 


The demos will also help you become more familiar with MATLAB and its 
capabilities. To run them, type 


> demo 
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If you quit MATLAB now, all the commands you have typed will be lost. 
This is where the Edit window is useful. If you choose new from the File 
menu, a new window appears with the title Edit 1: Untitled. In this window, 
you should type in all the commands you would like MATLAB to execute 
at once. When you are finished typing, you may save the file by choosing 
save or save as in the file menu and by entering a name for the file. If the 
edit window is active (that is, if it appears in front), then choose save and 
go from the file menu to save the file and execute it. If the command 
window is active, then you can execute the file by entering its name. 


Editing Files. To test your understanding of file editing, enter the following 
commands in a file named myfile: 


clear, clg %Clear variables a 

nd graphics 

j=sqrt(-1); %TO be sure 

Z1=1+ .5*j),Z2=2+1.5*j %Enter variables 

Z23=Z1+22, Z4=Z1* 22 %COompute sum and p 

roduct 

axis([0 4 0 4]), %First plot 
axis('square'),plot(z1,'o') 

hold on %Allow overplot 


plot(z2,'0o'),plot(z3,'+'), %Other plots 


plot(z4,'*') 
hold off 


You do not have to type the % sign and the text that follows it. These are 
simply comments in a file. They are ignored by the MATLAB interpreter. 
You should, however, make a habit of adding comments (preceded by %) to 
your file if you want to be able to understand programs that have been 
written long ago. 


Do not forget to save your file. Such a file is called a script file. It contains 
MATLAB commands that could have been entered one by one in the 
command window. You have three ways to execute a script file: 


i. with the edit window active, choose save and go from the file menu; 

ii. with the command window active, enter the file name; or 

iii. with the command window active, choose run script... from the M-file 
menu. In this case, a menu pops up to ask you which file you want to 
execute. 


mie 


Try each of these three methods in order to get used to their differences. 
Figure 1 shows the plot that you should get. 


Creating Functions. MATLAB puts many commands at your disposal, and 
you just have to enter their names (with or without arguments) to execute 
them. Some commands are built in to MATLAB, and others are contained 
in files to which you have access (not to modify them, but to see how things 
are done). You can try to display the contents of a command file in the 
command window by entering the command type filename. Enter type plot. 
You should get the message 


2??? Built-in function. 
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This means that the command plot is a build-in function and thus cannot 
be typed by the user (it is compiled with the program). Similarly, try to type 
the function file containing the command sinh by entering the command 
type sinh. You should get 


function y=sinh(x) 
%SINH SINH(X) is the hyperbolic sine of the elemen 
ts of X. 


y=(exp(x)-exp(-x) )/2; 


This is a typical example of a function file. It differs from a script file by the 
fact that the first line contains the word Function. What it does is make 
the command y = sSinh(x) equivalent to the command y=(exp(x) - 
exp(-x))/2.The variable x is the input argument (there could be more 
than one), and y is the output argument (there also could be more than one). 
The second line contains comments about the function and its arguments. 
They can spread to several lines. Display them by typing the command 
help sinh. The third line contains the actual function commands (there 
could also be several lines). 


As an exercise, enter the function per p(x): 


function y=perp(x) 

%PERP PERP(X) is a complex value perpendicular to 
XxX. 

j=sqrt(-1); 

y=j*real(x)-imag(x); 


Evaluate perp on various complex numbers. Replace the last line by y = 
x*J;. Do you get the same result? Why? 


Note that local variables are just that—local. Only the input and out- put 
arguments are kept in memory after the function is called and executed. For 
more on functions and multiple arguments, see the MATLAB manual. In 
the previous example, the variable 7 is local. If, before using the function 
perp, you use the same variable 7, its contents will not be affected by the 
command perp(Z). Verify this. 


Normally, while a file (script or function) is executed, the commands are 
not displayed on the command window. Just the results are displayed. The 
command echo allows you to view all the instructions. This is useful for 
debugging and demonstrations. It is turned off by typing echo off. 


An Introduction to MATLAB: Editing Files and Creating Functions (PC) 


Note:This module is part of the collection, A First Course in Electrical 
and Computer Engineering. The LaTeX source files for this collection 
were created using an optical character recognition technology, and 
because of this process there may be more errors than usual. Please contact 
us if you discover any errors. 


You should write a MATLAB program whenever you anticipate exe- cuting 
some sequence of statements several times or again in a later session. On an 
IBM PC, you may use any text editor to write a program, as long as the file 
can be saved in ASCII format without the control codes used by most word 
processors. Applicable text editors include Edix, Wordstar, X Tree Pro, and 
Turbo Pascal's editor. 


Editing Files. If you have enough memory, you can run your editor without 
leaving MATLAB by using the exclamation point (!), like this: 


>> !EDIX 


The exclamation point may be used to execute any DOS command or 
program from MATLAB. When the command or program finishes, your 
MATLAB variables are just as you left them. Use your editor to write 
program lines just as you would type them in MATLAB's command mode. 
Then save the file with extension .m in the directory where you will run 
MATLAB. Such MATLAB programs are called m-files. You may run your 
m-file by typing the file name (without the .m extension) at the MATLAB 
command prompt (>). 


Script Files. There are two kinds of m-files, called script files and 
functions. Running a script file is exactly like typing the commands it 
contains at the >> prompt. Your m-file will automatically be a script file 
unless you specify otherwise, as described later. Practice by entering, 
saving, and running plotsin.m as listed next: 


t 


-6:.2:6,; 

y = sin(t),; 
plot(t,y) 
title('SINE' ) 
pause 

grid 

Xlabel('t') 
ylabel('sin(t)') 


When the pause is executed, you will need to press a key to go on. If you 
type whos after running plotsin, you will see that the variables t and y 
remain in memory. Comments are important to a script file. They are 
marked with the symbol %. Anything following this symbol on a line is 
assumed to be a comment and is ignored by the MATLAB program 
interpreter. 


Functions. Functions differ from script files in that they have designated 
input and output variables. Any other variables used within a function are 
local variables, which do not remain after the function terminates and which 
have no effect on variables outside the function. Many of the functions 
supplied with MATLAB are actually m-files. A good example is triu.m: 


> type triu 


The word function at the beginning of the file makes it a function rather 
than a script file. The function name in this line must match the file name. 
The input variables of tr iu are X and k, meaning that the first input 
argument will be referred to as X and the second as k within the function. 
Likewise, the function line designates y as the output. There is nothing 
special about the variable names x,k, and y when the function is used. It is 
only that whatever inputs and output you use will be referred to as x,k, and 
y inside the function. The variables m,n,j, and 1 are created temporarily 
when tru runs and disappear when it terminates. They are local variables 
and have no effect on variables with the same names outside the function. 
In contrast, a script file has no local variables and does no substitution of 
input and output variable names. 


As an exercise, enter and save the function perp.m: 


function y=perp(x) 

% PERP(X) is a complex number perpendicular 
to xX. 

j 

y 


sqrt(-1); 
j*real(x)-imag(x); 


Evaluate perp on various complex numbers. Replace the last line by y = 
x*j;.Do you get the same result? Why? 


Printing Files and Graphics. To display an m-file on the screen, use the 
instruction >> type filename. To make a copy at a printer, use the 
DOS command > !print filename.m. Graphics hardcopy is 
available through the commands meta and gpp. See the MATLAB manual 
for more information. 


An Introduction to MATLAB: Loops and Control 


Note:This module is part of the collection, A First Course in Electrical 
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MATLAB has control statements like those in most computer languages. 
We will only study the for loop here. See the MATLAB manual for details 
on if and while statements. 


What for loops do is allow a statement or a group of statements to be 
repeated. For example, 


for 1 = 1:n,x(1) = 0,end 


assigns the value 0 to the first n elements of the array x. If Nn is less than 1, 
the instruction is still valid but nothing will be done. If n is not defined, 
then the following message will appear: 


2??? Undefined function or variable. 
Symbol in question==>n 


If n contains a real value, the integer part of n is used. If nN is a complex 
number, the integer part of the real part is taken. (This should, however, be 
avoided.) If X is not declared earlier or is of smaller dimension than n, then 
extra memory is allocated. 


The command end must follow the command for. If it is not present, 
MATLAB will wait for remaining statements until you type the command 
end, and nothing will be executed in the meantime. 


More than one statement may be included in the loop. The statement 1: Nn is 
the way to specify all the integer values between 1 and n. A step different 
than 1 may be specified. For example, the statement For 
1=1:2:5,x(1)=1, end is equivalent to x(1)=1, x(5)=1. Negative 
steps are also allowed, as in 1=n: -1: 1. 


We may use a fOr loop to draw a circle of radius 1. Type 


> j=sqrt(-1); 

> n=360; 

> for i=1:n,circle(1)=exp(2*j*i*pi/n);end; 
>plot(circle) 


Note how easy it is to plot a curve. But also note how slowly the for loop is 
executed. MATLAB is not very good at executing things one by one. It 
prefers, by far, a vector-oriented statement. Using the range specification as 
in the for loop, it is possible to speed up the process by replacing the 
explicit for loop by an implicit For loop using the colon, like this: 


> circle = exp((2*j*pi/n)*[1:n]); 


Note how much faster this command is executed. In general, For loops 
should be avoided as much as possible. For example, the first for loop you 
typed could have been replaced by the command x=zeros(1,n);, which 
is much more efficient. The function Zeros fills the variable with 0's to the 
specified size. Similarly, the function ones fills the variable with 1's. The 
size can also be determined by the size of the argument. If A is a matrix of 
size m, n, then the command B=ones (a) fills the matrix B with 1's and 
forces the matrix B to have exactly the same size as the matrix A. 


Avoiding for Loops. Since for loops are very inefficient in MATLAB 
(they are computed sequentially, adding several more computations for 
every loop), it is preferable to use the matrix capabilities of MATLAB to 
replace for loops and speed up processing time. 


i. Replace the for loop 
Oldie Ore ee ema 


il. 


ili. 


iv. 


with 
> xX 
to get 


1310 


X= [le2 3 45 6 7-8 9 10] 
Replace the for loop 


1:10, > x(i) = 


>> z = something; > for i 

Z*1; > end: 

with 

> zZ = something > x = z*(1:10); 
to get 

MS P2922 SZ, Z| 


Replace the for loop 
something; > x(1) = z; > for i = 2:10. 


> Z 


>> x(i) 


with 
> Z 
to get 


KS PZ 


= z*x(1-1); > end; 


something; > x = z.4(1:10); 


Replace the for loop 


= Tor 1 = O0:25100, > x(1) = 


ZNO) 


iO 7exp( 4) 2- pay a7 100). end: 


with 
> xX 
to get 


iG exp 2 pis (022: 1007 100); 


xX = [exp(0) exp(j*4*p1i/100) 
exp(j*200*pi/100) | 


The Edix Editor 
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On a PC, creating and editing m-files is done outside of MATLAB. The 
Edix editor is one of the visual types of editors. It can be accessed either 
from DOS by using > edix filename.m or from MATLAB by using 
> !edix filename.m. The latter will return you back to MATLAB 
after you quit the editor. MATLAB script files must end with a .m 
extension. They are also called m-files. Once in the Edix editor, a help list 
of the editor's commands can be displayed on the screen by typing “Alt-h.” 
(Note: The “Alt” key must be held down while the "h" key is depressed.) 
Pressing the “space” key will get you out of the help system. 


A new file is created by calling the Edix editor with a file name that has not 
been previously used. If the file name already exists on disc, then that file is 
retrieved and displayed on the screen and is ready to be edited. Since Edix 
is a visual editor, changes are made by moving the cursor to wherever 
something needs to be typed in or deleted. Once changes have been made to 
a file, then that file needs to be saved (written to disc) by typing “Alt-w.” 
When all editing is finished, you may exit from the Edix editor by typing 
“Alt-x.” Following are some useful Edix editor commands: 


Alt- help-list of all the Edix commands; 
h(F7) 


saves (writes) the file to disc; 


exits the Edix editor; 

deletes the current line; 

the first time used, marks the beginning of a block of lines 
to be edited; the second time, marks the end of a block of 
lines to be edited and highlights this block in white; the 
third time, "un-marks" the highlighted block; 


makes a copy of the highlighted block at the current cursor 
location (can be repeated); 


(moves the highlighted block to the current cursor location 
(can be repeated); 


put the cursor at the beginning of the line; and 


puts the cursor at the end of the line. 


Useful Mathematical Identities 


Note:This module is part of the collection, A First Course in Electrical 
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eJ? cos 0, and sin 0 


e® =limy 0 (1+ 52)" = O25 456)” =cos 6 + j sin 6 
00 —1)" jan _- oo n 
COS. =) a0 yet) = 24 oor? sis 
OS) ihe Ny eet 


67". sin 6 = 
(2n)! ap (2+ 1)! 


cos 0 = 
n=0 


Trigonometric Identities 


Equation: 
sin? 0+ cos? 6=1 
Equation: 
sin (9+ y) =sin 6 cos y+ cos 6 sin y 
Equation: 
cos (9+ y) =cos 6 cos y— sin @ sin y 
Equation: 


sin (0 — y) =sin 6 cos y— cos 8 sin y 


Equation: 


cos (0 — y) =cos 6 cos y+ sin @ sin y 


Euler's Equations 


Equation: 
e)” —cos 6 + j sin 6 
Equation: 
eI? — eI? 
sin 9 = : 
2j 
Equation: 
e+e I 
cos 6 = : 


De Moivre's Identity 
Equation: 


(cos 6+ 7 sin 0)” =cos né + j sin nO 


Binomial Expansion 


Equation: 


Equation: 


Geometric Sums 


Equation: 


Equation: 


Taylor's Series 


Equation: 


Equation: 


N 
= ~ 
n=0 uy 
az* = —— |z|<1 
k=0 
a a(1— 2% 
az* = ( Ni 
‘an oe 
f(e)= f(a 229 
= k! 


(Maclaurin’s Series if a = 0) 


